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FILE 'HOME' ENTERED AT 10:15:12 ON 12 MAR 2004 

FILE 'MEDLINE, BIOSIS, CAPLUS , EMBASE, SCISEARCH' ENTERED AT 10:15:26 ON 
12 MAR 2004 



LI 6786 S MSI OR MICORSATELLITE INSTABILITY 

L2 989 S RIZ1 OR RIZ OR RETINOBLASTOMA PROTEIN INTERACTING ZINC 

L3 21 S LI (P) L2 

L4 8 DUP REM L3 (13 DUPLICATES REMOVED) 



FILE ' STNGUIDE ' ENTERED AT 10:17:23 ON 12 MAR 2004 

FILE ' STNGUIDE ' ENTERED AT 10:22:08 ON 12 MAR 2004 

FILE 'MEDLINE, BIOSIS, CAPLUS, EMBASE, SCISEARCH' ENTERED AT 10:22:14 ON 
12 MAR 2004 

L5 153 S LI (P) (POLYA?) 
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L7 1 S L6 AND L2 
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ft*ftJ£(MSI)PBttW« 

[57]«« 

RIZl DNA#?j$fAS] MSI PHt£JW® t f ,#£ W 

®*^J£##fli!l#«£-fcW wzi SflMo Ent*£ 
ijJit#-#$£W® MSi#£to#ft 0 a-^r&iiil 

rizi m m&®*£&&®®*&^to®3*® 

feWmMftft MSI PBtto Rizi £0+ #&#iE1!!'&B 
MffiHSI^SIiJfcWJt* msi BH4„ 
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00 
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ft m m * =& 



u -wmmim msi PBttw*w^j*. rizi £ttw«» 

2, ttfiis* i j?fii*w^tt. fittis riz £g a a 

3* ftjpJH* 2 JJf3f#flWi*. K4> RIZ £^ A ft RIZ (A) 8 fffll 
4, ftfd«$ 2 j|f**W^fc. RIZ £i£ A ft££ RIZ (A) 9 Jffi 

7, jfctfffl* l 0BS#«tifft. ItMl^illit^tiii Rizi # 

8, ftfijfi* l Jlf»*W5Eri*. &4 1 Rizi £tt*W-5tffll^ 99 *W 95% 

io> ftjpjg* 8 gfww^rtt, s+ rizi 4 w 80% m 

IK 10 RIZI £ttfifilW5 4 

12, i fflf*W5firft. S+ttKffWtt^-^***^*. 

13> Rfl5#i2 0r5*M2rtt. a+«#*#*-^l***ft#- 

15, ftjpJH* l J?f5l*W^rfe. MSI RttttflAtMHSMf. W*. 

16, ftspJS* 15 ffWtt^rfe. msi nttMM££$ Artttttft 

17, MSI PHttttSW^ft. MliiiiStl^ RIZ S@±£ 
X A ftj&JJMn*ttftS*&£ MSI RIZ *H±+*iE«»Btt^* A 
^ifcJM^MSI PBtt. 

18, tt*|3l* 17 0fg*W2F*fc. RIZ £^ A RIZ (A) 8 
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4? $ 




19, ftftg* 17 J|f**M;j5rtt. RIZ £X A RIZ (A) 9 Jf^iJ 

to 

23, ft^J5*22mH*W^rfe, MSI Rtttt*MtHfi$AMttitff 
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m m =u 



(riz) m®mTtkmmmzm&w*%fe cmsd nem 

SJfefe^tt^PBtt (MSI + ) frHS. 

j&fflHJ$3llfl&0*g& (Retinoblastoma RB) ffl^$^f(Zinc Finger) 
(RIZ) £-*»£WW*ffl*!l*B. PR (PRDI-BFI-RIZ-1) 

°£ SET (Suvar3-9, Enhancer-of zeste, T_rithorax) 

PR/SET (x^^A^flt^^PM^ffl, S^tt^^l^^ 

ita^wii^tiMo riz pr E««£££aMffifftiw&£#flB 

riz SEI-MM^SM/^. RIZl ft RIZ2, glS^J^iCS^^W 
PR lE^o RIZl PR E#) ^ti^^il-ttf 1 ^m&Wm^ 36 IE 

d P 36) mm&]ffimmmm®o ip36 e^a^+-#w±w^4 i w^ 

^^M^MMft^ RIZl RIZ2 SSTO^iio 

ffF*. MMft. »tt£rt#»£3EW* (He $, Cancer Research. 
58:4238-4244, 1998; Jiang ^, Int. J. Cancer. 83:541-544, 1999) . & 

*Mft#t££ rizi mm&tt, w rizi ss^^issajig^tti. £ 

i^MMl 1 , RIZl G2/M fi^Pl^T, *ffl 

IX, Jfcft#^»^tt (CIN) »JBfi»!fe#^iljett (MSI) o CIN ^±^#tiE^ 
ififtfe#^il^tttlli)l***^fllll6rttt*«fll«« (DNA) 

n&m, itimn. msi PHttttw*»*«i«^is»!fe#fia^j«^$ 



•••• 



Adnull £tfjffitttf>tt f WiRJSJ&fr RIZ1, p53,5£ Apoptag mMffltim&fem 
(TUNEL ) o 

riz mm dna #£M4>^&£^itt riz zmmmm*. 
ms^Bj^ii^j^M^w^, rizi ^J3tt^rtfp#^«j msi mmm 
w»*^. riz A ) msi mmm 

&mm%Miimm&tt*ufe&mmmmmm msi preM^^M 

MSI PBttJflMW 40%W±*ai±fl<iMjfefe#liJ3a^* t 60%^±M 

riz mm^nmpk^wiMim^o ®m&T&usimmm*, rizi ^ 
WA^^ihJmitmm, ^f^m^m rizi 

mm^&ftMo Est^m^'-mm^m^ msi pstt^j^^ c j?el 

Boland ^, Cancer Res. 58:5248-5257, 1998) „ M J.lt^±^l^^^|ffni 
Mlfc&ft^ltlo iift&ififiS Boland ^£±i£»*iliijftfet!*&, £ 
MMgft BAT25 (gBJf Genbank 9834508) BAT26 (SEJf 

Genbank 9834505) , ^^Sgftj D5S346 Genbank Jf^iJ 

181171) , D2S123 (mmm Genbank j^ij 187953) , *D D17S250 (g@J$ 
Genbank Jl^iJ 177030) . %t£mm* : fcft?W : $&&, £fl Boland ^£±j££ 
jtK^i'Jtfj, Mao Proc. Natl. Acad. Sci. USA 91:9871-9875, 1994 

msi rattans* m*mm'>frikto*kim) , 
w#mm, &mm, nm, mum, ttftmm, tmhs. m 
m, &m crfttt&aj) , 'bmmmm, #<mmmm, fttttt#mi cj»# 

MMfotil) ^ (JSLClaij^U$, Exp. Cell. Res. 246:1-10, 1999) . 

^»*«tt'W#J&Wtt^K« (HNPCC) &#«ia»»fe#^a€tt. 

hnpcc ^-f+^^ttM, &&fgmm&j 10%. hnpcc M±g#jS£jH«*g££ 
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&7Sfc£££$*h DNA fi@e{#g£&£flffltm-f7tfJ DNA 
#J*tto feTZWikfiMFu, W^ikfoRM, %\ N-methyl-N'-nitro- 
ni'troguanidine, N-methyl-N-nitrosourea, Streptozocin, temozolonide, 
decarbazine; ^tfci^J, #P Cisplatin Carboplatin; WM^$k%Q, £q 
6-thioguanine; RQfofafj&M, tl\ busulfan, etoposide, |P doxorubin ^ 

®%mmmm#jt&mm^jEif;&}%nmn%%i, msi PHttWiifM 

JESfflff fl5^. > htt*«ift«FW'«#P«aaiH P53 RB SOTJJ3i£$f 
«#5K»^#«##Ai!l ! «rW±jiSH!fi!$Wll l l'0 c t» (JJL Roth «, Oncology, 
13S5: 148-153, 1999) . j$M& MSI PHttWfl+S^&W RB l£ P53 $@ftpg, 

@jJtW&gJF£ft*7 MSI PBtoJgWSa?S?f ^rfe, #»-#oJliift& 

pj msi nttjMrtt&ia, a^*i»wi»i6ttii^rE*ttfe&^3E^jtw. 

*£E*#-#fflflOa»!fe#*ftett (MSI) rettW*g£-feW2rft. & 
-^fe«W«WiM^J RIZ1 MM DNA J??!l&A3J MSI PBttJJW'f. 

M^&^mw^w rizi g&Mo 

ran*jik»9m«-ft«l3feW* msi rizi 0f 

&£&*£J&m£^+&m»*to&E*WfeMl&Jk&% msi PHtt. rizi s@+ 

^mmnm^mm^mmmm^ msi ratt. 

S 1 ^AM RIZ £&M&£$JKtf (#VR^ hRIZ; SEQ ID No. 4) ^A 
§(, RIZ leiSSilftf (**i n Jj rRIZ; SEQ ID No. 2) tS^tttfc^o 

® 2 £$m#W3W|fMWJffl AdRIZl M^^^JlM^fe^^m^ 
ttPBtt (MSI+) WSrlR0atf*Hf&?f . RIZI £B$$fflA££lM 

HCT116 ^M^ilJ^Ha^ijm^, ^5frmj, $W Afff KlKR WMA'hi^T 

GMISMB. ^4 , ^W»rtsKW*^ffl^j3iJft*f«?»|'«[ (PBS) , xtfiaSfittJS 
(Adnull) p53 SSM^fH* (Adp53) , ^ RIZ 11^1 

mmw (AdRizn o ^^jm^j^^t; sxio 10 mmmn. tv 

ffl 3 M#f#^ftflt;lf M«t>it4T RIZI, p53, *D*|!IflMt:MfcM*R 
tti3*lfefe#0r. HCT116 MMft&tt AdRIZl, Adp53, 



amjifatt»«i*i , *4-ife*»*«w , *a*^3tt. awtt*ai»4^ 

«sipjiii«w*4ifeRrwifi»raft^««. «»y 1 0?*. nm® rizi 
msi wttw*an&^fflJHffB« g2/m us, ^ta*aifi*s«rw 

RIZ £B$XM*£ft#3lt&ffi%$. 

^«WRlZi«iiliBttMSIPBttW**^- ftlt^A 1 * 1 , «2tt*rffi& 

(PSA) , #££J3W«£iMB*WW«##ttiS. WHMtttf^K-lR^aiE 

*nT£«'J i lP$Jfiffil 3 0pf, £ msi Pfittlt^lfitt^iii'l 3 , ##J^ 
£W RIZ g@£IM«St0?'Jf&ft^tf] MSI WWt. *fc*HRtt. 
RIZI ^?ltlWt, Mfflte^UWteil^ DNA 

Apoptag TaSftJ, £ Intergen . frl^W'J 1 flr/J^Mt^^ 

(FACS) ft FACS a-tfr+^fttote? Gl ffl DNA * 

IWHSfl^itfiMtMiM DNA 

♦SHfiJWft MSI PBtttt*+5jX»a»RlZl ^IkttttK^. #*J£ 
W$ftJ* RIZI fie**»>f W*. ^MffltfJ RIZI ^tt^AW RIZI 
J¥?iJft£l*o RIZI £tt#J&iSM0rfll 4 (SEQ ID No. 4) «#4lflgttft 

^g|W|, flR5£ff-+*«*IS5tt*fl. RSifttfflAWHHB MSI Rltttll 

MUMP. M4>» Mtt, fi*W»J**K»«JH'««tt*afi4-lfettatt. 
^itWA^ RIZ *B*fll (Jlrfll 3 4 Wfll+ttftMM 5 

a*w*R riz ssm^j mm 1 tfffl) «fc 2 

?|J ; *R RIZ S@WJ (104 Wfl|> BftMH^J^ 6069231, 5831108, ft 
5811304 EUt£ Buyse '3? Proc. Natl. Acad. Sci. USA 92:4467-4471, 1995 



£b Mj&Ms^ 75%W±, 80%, 85%, 90%, 95%, 98%, 99%, WL%£fan 

zfto aaww^ft^^-ftwaatt+^ftw riz g@o sng-^, aw 
g%*mmxm, mm^*M&&&tonmwm&-+teftmi riz 

ftS. IftftB, *E*C», feM®, fl»*«*#tt*S*L 

iiw^stttii(^itti. &m mk. asm ^h#rk«« 

iflffl^llflt^ii, JS««»£ftMC«. tf#E«l«ft»Mft. 

ia^w. ft^EffMfiffi. i^Rfatt^. ^^.mrnmi gst jm 

xt#ai 4 + w riz i ^mmmm^iEtmn 4 w^fe#iEt# 
$p^j 102 m 103 +sf^ rizi &=.^&&&&<ko R-^yn&WiEisB-w 

$mto% T969 A969069 &ftMMl£3^MW) . 283 
&D«Eo T969 ^frttTUt A969 ?>J 4 "fltt^-^ 

Sfefe#iEtt#fl* 704 t^iiSii^. 

4— ftfitfflt. ffltt+^Wttffi^rftftflttft^iiW Rizi£tt*#fl|4 

ft PR E«ffl#(jfc#»ftfcJ*fll 99), 8fci|ttJ*flAW 95%W±ffiIil^if 

n, in*SlttRiziPRE#&riG. 

S RIZ l MS^W-^ 100 PR PRD1- 

BFl jfP Blimp-1 42%^^«o PR E*^Evi-l gfiftN 

-iffi (JB, Morishita^, cell 54:831-840, 1998; Morishita 
Oncogene, 5: 936-971, 1990) • fP C. Elegans (ai'h&A) (ft egl~43 §6#J 
N-iB (JUL Garriga m, Genes Devel. 7:2097-2109, 1993 13) Wl^f 



PR E^BME^ RIZl ftTHftftUBflfjfiMR (B He Cancer Res. 58: 

4238-4244, 1998) , #^#-5&fc#fi#M#^«^«l#J <Xie J. 
Biol. Chem. 272:26360-26366, 1997: Huang ^ , J. Biol. Chem. 
273:15933-15939, 1998) . 

PR E«»Jt 100-120 ^M»,-WH^*««9Wfi[jSi, A, B, 
C «M^W 10-12 >MUM, + 20-35 >MU£»#JF . A, B, C $ 
SMIfiK. AKWRIZ1 PR EWW 2 36-151) ft A* RIZ 
PR E^OWJ 4 * 37-152, iflWJiJ 99)S*ffi|iJ, R£AW£ 70 &MS&± 

&JMCK, «EfBI*]fta±(69 ft), ^IftiDMiK. DNA Mr+# 

mttn^R pr e$ b ft c aKtffli* 104 ft 105) j¥m*mwMum®Mf¥ w 

tffll 105 <f»Mtfr 75 J5A& RIZl <£ PR E*(IPA£ RIZl J*BI 99 + 

42-1 16) 1^1 o 

RIZl ^ttaWA^WaWlfe^r. E1A ftNBS&ffifeltt RB 

,6; 8^3^^: MPR^E^. PR E C-Jft PR g^E^cW 

ffiS^RTSB* RIZl ^AR#A#^SIRtt^li. 

rizi $&nu mis n^mm SHftftgjig. 
$p«^j 1 ft&itffl 3 m**, — MB*mwf*^ msi nttttAiBtt^n riz 1 

^tt***«HW mis RttttA^mtFtt. Sifc. ^-tlWll^N^- 

»4j-jg#jiittPr«« riz £tttott»j*fli#A msi nttftMi msi rattflt^^ffl 
rjww^ riz ^ttttttK^wsAaawftflc+ffl^Ma^^t 

^SW*#fifi. inifi^f:** (Jl*J«|S6WMuLV, 

HTLV, ft HIV «#) , fttt£ffi*tttt. ft-feft 

£P#3^@£I> DNA, |fl»iS#Ilii| (#JHL Kaplitt IP Loewy, 
Viral Vectors. Gene Therapy and Neuroscience Applications. Acad. 
Press, San Diego, CA 1995; Chang, Somatic Gene Theraoy, CRC Press, 
Boca Raton, FL, 1995) . J£*a*itt«[«W^j*4Rlt:+ar*fftt. 

Bmmmwztmj&$mT&% P53 ^^wjssi'JBt 

* + , AWffc#tt> i^7K¥S@^Ji, ftM&-J7$3l (JUL Swisher ^, J. 
Natl. Cancer. Inst. 91:763-771, 1999; Clayman, Seminars in Oncology 
2788:39-43, 2000) . 
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?mm®?®ij&m*&ftfeMm®*m&m® riz mm^mt (*n Dai 

Proc. Natl. Acad. Sci. 89:10892-10895, 1992) . 5£#^ RIZ SEftjM 

^IClfiHt^^ (fclttft. tt£tt£tt, i£ HIV Tat §SM^) o 

Anderson, Nature 392:25-30, 1998c 

mm riz ^ttWttKnrw#ftj4a»Aa«A#w^M4w*+*ii*» 
^jsaww^aa. Manama, flwsA^^M^TiHiiifi 

j^j^«A{*3§SJJl+*&&£ (in Swisher^, Supra, 1999) . J^SMIAltt 

dna ^A^astttesi riz &iktofflmAMimmti.&nG 

MSI Btt#fiEtt2fft. £^&M£fl«r*f 

MSI PSttc 

m, #ft£JR»*i»MW50tt. msi 

»f\ &W&»RIZ £XllMi*tt&&$. Sifc^iJJiJM^W RIZ £ 

Hi?*J»i«i^ft^$«**fil!*jH:llt** MSI PB Wl. 

ift&tti rizi cdna iwsas^fi? 8 g«imft#i^xMi4 

(PolyA) gteWfifc^E. -^£iWJ 3 M 4393-4400 + A8, 
£0?|J 3 W 4582-4590 + W A9» RIZ A8 £»l^ff?iJ&^WfWi. 

C-^W±^, jlkttflM8[*5gftiaWBt«¥¥»>r^- A9 JWJ£lfc|i] 
-*?¥*l#JTiSI. jfc&SMfc&aM&ifefc RIZ C-*g PR E#£<£&j£. 

riz as §k A9 ff^j+jftw^^w^ttSRitfwrat+refti 

ft. ®m, pcr r* Riz dna, riz tt& dna, fcjgatfr^HSWK^ 
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fitt (SSCP) ##T, SRMStfr DNA £lfcfc«#ij 1 *fl«#J 2 "fWAi* 

ttifi dna #i&*#Mafea5mH'*. inmwMR. mn&*mi3to&& 

ifirflnfc. msi PHttflt«bk msi RttmxtXSft^M. S«^^7 

tmfcttm&tt msi PBttmM. a#*«i^«#*#»jr**. m&m« 

5£tt, MSI RIZl SHffrfflft. Stt RIZ »@^pT# 

£JM K * MSI mm®* RIZl SBg&WNR RIZ £B*&tiffUS 

fi*&*fl*i»££ftfniftMrto MSI rattW»+#£Wrt«*tt RIZl £ 

mm a jJt^wij^Bjs^w riz mm&m a ffw*ft*5s 

w msi mmm*%Mm&m% riz ^tt^tt^mtt^maiAxiffiiflWr 

jfDtt*»MI£:. 

m®M&%\mm& mini 22 ^ msi w mmmm, s 

M»ttffi£WJfefe#*fc£tt CCIN) «S, l ^&£#M^tB4P 

W#i3/£ D1S228 (32.4 Jfcftjg (cm) ) ft D1S507 (36.2 era) ±fc5fc#28tt 
(LOH) (Canzian ^, Cancer Res. 56:3331-3337, 1996) . £ 14 -MtffiSB 

i&£ i ^Wt&wm loh iha^a. 3 ^ msi mmmmmw&mmummm 

(ATCC) &= > M*£MDAMB231, MDAMD435S, #1 SKBR3 (?M& 

ffi) . WltJ^^Sil^^^J^t RIZ ftltii (He «, Cancer Res. 58:4238- 
4244, 1998) . 

s*HPFft7 24 wjmmim&tiL&Bii* (hnpco iaiim msi 

PBttfltTf (Aaltoren 3 s , N. Eng. J. Med. 338:1481-1487, 1998) , EflFSfc 
T 11 ^* ATCC ^ftM MSI PHttM*ffl«o tinm DLD1, LS411N, 
SW48, HCT116, LoVo, HCT15, #1 LS174T(£J©Jg) ; MDAH2774, *P SK-0V3(#P 
JM)t AN3CA(^Sft*l*)i »DU145(i!iF^lRjfi). 
frfcSWt (LOH) ftM m%%M?ftZM.&&W1i&ftffiiE%ftfe&M®® 

Mrto ^mpfn&m®$mWife®m (http://www.gdb.org) . :nm&fe# 

r»*#*: ft 15 ul 10 ng ffljSttt»ife# DNA, 8 pMol 

(5' Slttff&fttiHB) , 100 nM iftMffimWSeM ( dNTPs ) ,0.6 #■ 
® (U) DNA IR-^BI (AmpliTaq Gold, PE Biosystems, Foster City, CA) , 
10 mM Tris.HCL (pH 8.3), 50 mM Hifcfl (KC1) , 2 mM (MgCl 2 ) . 

PCR ^£ 377XL ff?IJ#0Tft±StfT4Nff (PE Biosystems) . Vk&fo&Wfo*. 
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'h#l9<3fcfM& GENESCAN ft GENOTYPER (PE Biosystems) &%]feMU & 
^^^fe#^-#W?SStfc^J^^iPoIit# LOH ffimfe (Canzian Cancer 
Res. 56:3331-3337, 1996) „ &3l-^#p a B&&##ftfttfc#!MNT 0 . 67 sK* 
f 1.5, »Jttttfc#£ LOHo 

fi^fi^ HNPCC MSI ffitt, ^-ttffK^fttt (SNPs) afjfeWffl 

*##r 24 HNPCC jEfltlPW* DNA 3I4WJiJe& A3H SNP gfc$g£$*f 

(http://www.genome.wi.rait.deu/snp/human/index.html) « SNP 
^J: £ 25 ul jg/£$*W 20 ng fl£Ztfj&£#DNA, 8 nMol *BJ&ft3l 
m, 100 nM MlfflM&& : g : $ = «f® ( dNTPs ) ,1.0 (U) DNA 

(AmpliTaq Gold, PE Biosystems, Foster City, CA) , 10 mM Tris. HCL (pH 
8.3), 50 mM (KC1) , 2 mM iCtt.il (MgCl 2 ) . PCR /^g&K^SJK 

i mm&mmtMttmjs , s&ffl&^-^iM^Mrfo dna a*, dna 

Sltf. /jfWSI^JtP^W M13 fflR] (TGTAAAACGACGGCCAGT,ff?iJ-§- 107) fP M13 
M ( CAGGAAACAGCTATGACC, 108) J??iJ. PCR £ 25 ul 

20 ng DNA, 8 nMol *Bj£ft3ltt. 100 nM WfflMft^ 

fiHii (dNTPs) ,1.0 (U) DNA fg£g§ (AmpliTaq Gold, PE 

Biosystems, Foster City, CA) , 10 mM Tris. HCL (pH 8.3), 50 mM HiW 
(KC1) , 2 mM (MgCl 2 ) . PCR j*tt&gft*Mflll 1 fffFltttm^K 

£tgj5, Igffllt-^IW^I^DNAliE 

^B^ifrftflf fflT^&tfJ RNA 5feffi RNAease t^??iJ^ (Qiagen, Chatsworth, 
CA) ttJIo Jx$£f:Btffl Superscript RT(Life Technologys, Rockville, MD), 
^jgffi GeneAmp Gold RNA PCR (PE Biosystems) cDNA rif. ffl 

RIZ1 IP RIZ2 tt#£tt3l<toBWflt*£ <* He «, Supra, 1998) . XJJH 
ft 3-^^-M-^^^iLK (GADPH) (Leygue ^, Cancer Res. 

58:1348-1352, 1998) . 

ftJgfflffiftfeftgftttfrgftflT tl^Buyse Proc. Natl. Acad. Sci. 
92:4467-4471 (1995) * ffi&m&mf ft ftffio ffltf 

RIZ2 iMNtffl RIZ ^#JflLyf 1637 £&&flrJM (*NR) ttlBftWMMMr 
&&#L&« 5%#J SDS ^MSf RIZ2 N-^ft RIZ 

fct#Kii?jt KGSE airfeaE»^«i*f 5S^«f - 

Riz l &Mi&±ffl^&MMMmMim£&&ga JHBttRizi 

mmtom&ttmmmMmMmt:ttftmftftij&ft& He ^, supra, 19980 

.A#2T$£fS*fc*nT. 5fc& 2X10 5 ^M^IfeTO6cm atji^^mL^, ^jefflM^a 
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mmmmwmm mm/m&t+% 100) . 48 ^hb, ^mumtf dna 

Wo 

i ^«.ft^g^!teffl» g i #»fe#*jst±w*a 40cm 

^^^E^W^^XlH]SlJ^W^« (JB Chadwick Proc. 
Natl. Acad. Sci. 97:2662-2667, 2000 tH) . ft&^E^Wyi/Htfttfj 
JftmBftJH'ffjMSB. &&ft 46.2 cm tfj PAX7, 4 46.2 cm #J PLA2, ft 
52.4 cm W E2F2, |Pft 71 cm ft MYCU ^ MSI lEteM ffifcb, W^I^&E 

WAMMnEPM 1 msi PH'tt^ hnpcc AAM. m$*£tfc&^ 1 

W SNP ^tHA&tt73i. fift^ 32.2 cm fflft£4ftf WIAF-481 A—f-^JEM 
£ft 53% (15 -H^ls 8/15) HNPCC IW+W LOH. 

&£f*&#ft snp •t»ttw»Rrwm»aa. %wm^&&&mm&ft£ 

&9tt. MSI BttWtf+ftWifrit. ffittiT. ft 22. 9 cm&|ftDlS450 

(1/11) |Pft32.4 cm£fctf]DlS228 (1/8) LOH #jtfc^Msf#£. 
fiftfr RIZ ft 1 j*%fttelgggftM»ftaEH ftflaMStf 1 " 1 pTel 32.2 cm&tfj 
Jij^SSiatf ^£##T. RIZ gE&^SSS 99 ^-ft 32.2 cm & 
(http://www.ncbi.nlm.nih.gov/genemap99) . £&#Al!fefe##flT$IHi 
RIZ & D1S228 370 kb & (Leygue ^, Supra, 1998) . RIZSSW3'|lJ 

5^^1^iMIf*S. ft HNPCC W»+ f A WIAF-481 $J RIZ 3* SI 
RIZ 5'^flRj LOH fflmWMLfr. X* RIZ iBrtH^tt^ fiJB0*F8+«« 
704 ft^rtfi^+M CA MJ*8I. |Pft4^hM?&tfJ 

SNP(ft 4 3' «t 18 -titSft GAT GAOaMrT^ffliMWr. 

RIZ SMUBtfj LOH «*^»J^ 21% (3/14), 9% (1/11), IP 0% (0/8). 5393T 
ttft 8 ^hfi? Pro-704 W LOH, flft^^'SW^A:. RIZ 
SE + £5fc»fJF&j6ffi-S. ft MSI BBttiB*»5fett l *Hftfe#«« LOH 

loh 30% (4/13), 23% 0/13), in 

60%(3/5)o 7T-0SU fltiffft RIZ S@ Pro-704 ±W LOH, H&gT 4 
WCA^W#SStt, X*g§£W«j£ftRlZ£Bfc». 4^ RIZ SHtt^ *H 
ft LOH tt/SWifi. IWft RIZ SB-+£mW »tS£, RIZ SSftl^ttlP 

n a 1 -5 36 ft*gft a ftgs . 

Riz SHKlWffWSaftK 8 ^JR^+W**$'W A8 in A9 £^ 

ft 37. 5% (9/24) ft MSI ffift HNPCC M^JIW A8 ( 1 ^) |fl A9 ( 8 

+ ) ^^ft/L^h msi Wttam&tt+W6>h (HCTH6, 

LoVo, LS411N, LS174T, MDAH2774, |P AN3CA) W A9 ^ISft® ft 
HCT116IP AN3CA*fflfl6*H>, ^tMft / «Sft> ASTW'Nfefett&tt 
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jSKftft MSI 

mmn%mm^mmm&vtm, muwn pms2*p dm^h^ as 

A8 £^ftft MSI fflttWitSIMW (P=0.0005) . 
ft MSI PHttflMS+X&fT RECQL, BLM, *P KIAA0355 S@ A9 if JlJ^^f. 4 24 
^ MSI.WttWJS'f. RW-^ft RECQL m KIAA0355 £E+W- > H**£5fe 

(p=o.oo5) o ±.&mmmw riz ii^t^j^^Miitit^ 
mmm* riz ga&ttTicTTBfe riz saww^s^ai. 

RIZl IP RIZ2(He ^, Supra, 1998) . ft£#-!g lp36 I^Wt, RIZ1 ^ 
ii^ffc, W RIZ2 3ti£^(He ^, Supra, 1998; Jiang ^, Int. J. 
Cancer 83:541-547, 1999; Jiang 5$, Histol. Histopathol. 15:109-117, 
2000)o ;«£ftW RIZ ^W^lf^.t RIZ ^ii^gilJM, HMM RT- 
PCR RlZl^ii (He ^, Supra, 1998; Jiang Supra, 1999; 

Jiang #, Supra, 2000) , RIZ2 £gp&3ft RIZl 4>> m&3cfe&&Wfe 
RIZ2 3gj£ 0 mnP>f&M RIZl |P RIZ1+RIZ2 ^ii7K¥o M;l§*fflflS$c 
MDAMB435S f£3l RIZ2 ff)*f$L M MDAMB21 f^^j^ii RIZl +RIZ2 WXtJS 

(He 3?, Supra, 1998) . 

11 ^ msi HttttftJMfttt'f>* 4 ^ rizi *j&MHt»«^M 

RIZ2 3ti£#ft, £^M»Jltt RIZl IP RIZ2 

(He^, Supra, 1998; Jiang M, Supra, 1999; Jiang *£, Supra, 2000) . 
ft 3 >h RIZl ^iiWMlftM*, HCT116, #ftW£I£M$?ltt^ft^ 
fti&ifffifl&'f, LS411N RIZ 2, L0V0 + RIZl SW48 * 

Rizi *ttMHK, fi&W RIZ *H£««#J^ft«o Riz gam*, 

w 8 Kb, RTifitfa^aia+ft riz siag-s^, sKSjaa^+wsa^. 

ft &£PHttM hnpcc W^^+ttllW RNA rizi ^jiW#*J$4k. 
iiRTISfefiil^ftttllW^W RNA Btgi!|jE*ffl«IW^»!. aUfitt+JUftttlft* 
BJ, Mft 90%fitl RIZ2 10%W RIZ1/RIZ2 RNA, RIZl . 

iiisaEgaa riz laiiMi s^ft riz ssg&K '-4700 A9 & 

ft->Ntt#5*c. jfiric-^tt^ RIZl WRIZ2§cettd>C-« -219 <i 
if jjnT 76 -fir^S^. RIZl m RIZ2 gatfcjE#4> 157 -t^M- 

**3e*$tt*fe###****a#tt*5 Riz ftW«Mfc*$W 
HCT116 jBft+afTftft«*»ftft»^li*5!*ift. &ff]*5fcflt£ft DLD1 *ffl 
fl&^&IESttfJ RIZl IP RIZ2 gfio ftWluff^6<]ffWIWaiJifi^4 3 » RIZl 
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3ti£*Mc» 'M&mM, m RIZ2 m^(He m, Supra, 1998; Buyse 
Supra, 1995; Liu ^, J. Biol. Chem. 272:2984-2991, 1997) o 
ft DLD1 ffflJ&^M'JIlJ 250 KD &} RIZ2, M&W#I'J RIZ1. *l&ft HCT116 *fflj& 
4>, M£-fcft RIZ2 *#ft, MfSI-t 230 KD ft RIZ2 q n q #, 
RIZ2 *<Mf-S. mfctt*lft#*iiEH RIZI *&, {IS RIZ2 

mm rizi ^wh*+w. 

rizi ^mMMmm^M^immMMmmmmmsmL mmmftw® 

RIZ1 *iSB«[iEifl4* MSI rctt6^Mftlffffi*t&ifij& G2/M JHJMJimW 
*n*fflfl&fflt:. M]«i^Tl»*i#^#W Rizi ^&ft HCT116 IP dldi &JKffi<D 

mm^Mttmo &tim&iwrm%& AdRizi cmoi=ioo) , ^mmwu 

W^Kft RIZI He^iic 48 /MtfJs. WNGflfe*^ G2/M $#J*fflj®i:BJiM 
ii^JPc ft AdRIZl ft&js 48 'MtffD 72 <Ntf, HCT116 SfflJWWfS^ 1 Gl $ DNA £ 

ftWaiflfi. M/TsW^SS^t:^ . {Ift DLDI &Jfe4 > &£g01!&ft4>. ^ 

J|iM RIZI ft HCT116 M^l^ G2/M SPIMt, Wft DLDI SBflfi+R 
3I&G2/M Pll. 

^- Table 1. RIZI ft^JM*ffl»fc JCT116 ft DLDI *^i£§l&£fflfl&/3Ji8 



Cell 
lines 


Apoptosi 
s 


Gl 


S 


G2/M 


HCT116 


25+1% 


-47+3% 


6. 1+7% 


20+8% 


DLDI 


9. 5+2% 


-28+1% 


2. 2+5% 


26+7% 



ft AdRIZl m AdNull ffiW$k 72 'b^tts, DNA 

«f- Ad RIZI -5 AdNull fc2fefflaifl&IS]^W#l£. 

AdRizi ft^jetb AdNuii nmnMrnsttiEmm*, 
ft msi rattw«+ , ft 3' Ntftii&$$i?tig c-« pr a^tt^ n-* 

PR EtfAMflSfM. ft MSI GMt-lf* (CIN fcft) , RIZI PR 

£5**friw#*. ft-«jtftt*ww»+. riz sss loh Mtt&gms 

gift, ^BJ-#ft-^RIZtS^WIt^^ , t I fe^. 

£*60|J2, « MSI WttttW*+<» RIZ . 

%^%mmmw riz ^i^i^ft^ft msi rattw*«+&£*MR 
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ftttai riz & msi PBttwjs+wf^ffi, mmm7-%wk®MffiA£± 
msi PR&ra&A, ^aa pcr id sscp wAM^^rfe*^ riz mm 
as ip A9 6«{fc 0 sun^MT-wj msi PHtt^jifi^, &mfgmm 

(HCT116, SW-48, LoVo, HCT8), t^JHI (DU145) , AMI (Cal-51), ? 
gftf (AN3CA, SK-UF18B). £1 MSI KttW SW620 ^XjflH. 

£ 33 MSI rott^I^^. Wl6^ (£48%) , £ 13 >h MSI PHttWaB 
flfi&^W 7 (£ 54, g + W HCT116, HCT8, LoVo, LS441N, LSL80, 
LSI74T, m AN3CA) W RIZ 3«. i&@&&£ttMgg. B2;£iESltt*S& 

l&7£ KSI9 A8 fc$*h M^M A9 ftjj&fc 

£ 21 #J RIZ (16 -f^-JlfP 7 ^NfflflSft) , 4 m (KS15, KS19, 

HCT116, AN3CA) ftM'Hfefeft^H*^. KS15 *P HCT116 Mtffi/^l*)^ 
KS19 &-^'%k&W±&] A8 ft A9 4«#W- > NttS5fc, ^-^&6#±£ 
A9 tt/S*— >M*S*5fe. M AN3CA £->Mfefi#± A9 £ 
-^lfefe#±« A9 ft^W-^**S5fe. 

MSI Wttl*4 , M#S*H&*5fe, MIS 16 *ff*MM«tt 
M+fc 14 >h, 4 17 16 >HS*t LOH &«J. Mlfp 

DNAo RIZ IS e&Me 8 ^S^FW 704 ttlii^H 
t«£ii/3c 704 ^)!Mitt£&£#tt, LOH. ftBft^«f 

6 >Mt*+»«W 2 #'R&»J LOH, KS07 ft KS20, -#£->h&£#J: 
-#ME3J4'MMI (KS07, KS15, KS19, KS20) ftW^Nfflflfi 
flc (HCT-116, AN3CA) W»fe# RIZ 5fe«. 

jfc«+tt*awjifw riz ttft9E$ira«i riz c-«i«f «. 

3 ft A9 76 *S6«fc> C-« 

219 £ A9 t + ^fc-'NttWtiEBJg 5 &W**ltOT5: £ A8 ft 

+S5fe- > h*li^^a4KJg 2 &W**±&i£, gg6&4> 293 >h«g 

Ho 

B*P RIZ1 ft C-^W PR RTIBfcJi RIZ1 tt^^jB^fli 

(Huang J. Biol. Chem. 273:15933-15440, 1998). Hffijfc C-Jgftlfltfc 
mRTfll^PS** RIZl Sbflfc. 

»iijws msi vmnm^Mmm riz uskw*. 54%mm 
m *w riz msi pg-tti^$ttt,w^o gLnnrnte riz ^wh^mm^* 

tt«ttfc$tf*H««£#fc-fl«l. riz 
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*m%. riz m®ftif& msi wtt^+ttft*. aaw^Tsaiaaaiw 

@#W»+. M*& RIZi ftife#lltfi£fcW*ift, EHtteWftT P53 

M> ATCC &£M HCT116 MlMIMtMiirWniflltt RIZI ft&IW££ P53 g 
Ho Delbecco iTOffj Eagle ffjbn 10«tttt^jL». 

^JMlKtMIt (Nu/Nu) £ Jackson (Bar Harbor, Maine) ft 

ft. 4 100 PSB *W 2X10 6 HCT116 ttMAifittM-' > MIK±. 6 %B 

Mmmb&m, *amea 0.5 cm. ttft4mM>**<hHttft&. 

4$*& 5-11 >K B-KM'AnMlMMAAtttf^A&lt PBS, MM** 

«i>SE9ftlAWSIHS#tt# AdNull (SffliCJfrJiflFftflr Prera Seth H 
sE^RIZl M AdRIZl, SRftfcp53 £6W-Adp53, Al#i*tt#Klftl#. 
Iflf. IP^SM^S 293 aiflfi±attf . JSL He ^, Cancer Res. 58:4328-4244, 
1998. 

ffflMI-RWfi*^, W##fcit#r&5Sfc aXb 2 X0.5, a 
fi. b fitS, ffl Student T »|«*tbtSi^Rim^ffl4«^WW 

£n® 2 #f^, £ PBS, AdNull, Adp53 ^fiWfR+M £&ffiit> 

ffl AdRIZl W»W<|NRW»4^*P93fi«a. 

ftff]*fllfeT«E3ta:WW*+ Rizi ft P 53 Ifitii-^MtW^Io 5fe 

*HCTiieiB»«ftffliiiii4>."ii:*4«:-^. nmm^&Mmm 0 a-tra 

&tt&ft%fi*tt£p53S&. RIZ«Jfc£DNAtffttt#iflS. 

RIZ ftftKkfiM 1637 ft#j|lLj|tff*# It 400 ft, ffl^ 1 p53 
AB545 trilfiLy*## 1: 200 ft. -&ft#*ii*Mfc4&ftiaM^ftft IgGo 
MJ*fflJMtTfrt, Sc«^J#ttffliftWffl Apotag J£ft&£MimftJ& 
(Intergen, Purchase, NY) «tf. flf^NW. *5*&JS, ffl^S 
§§&g, #^ TdT ft Avidin-aftik^SUK^JRlff DAB Jfefe. tJUJtffl 

Hematoxylin ixfr^fe. 

in® 3 **MHtit34 p53 RIZI £*toW*££l*»&ITO#»ja: 
gfflJI&&rtfti£ p53 sSc RIZI g&M. £ AdRIZl jWEfclttifaflt*! , PBS, 
AdNull, m Adp53 aitaWflttf+Wffi^matt Apotag -uEBJ3£ RIZI ft 

RIZg@^j£)£«£t3IMSI PHttWJ&RTWftflW^-K^^ISItiiailfiWt:. 
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^W^J^**itAM^-m^J*iS U.S. 08/516,859(1995^8^ 18 B) + 
SEQ ID NO. 1: 

(i) mmu: 

(A) 6171 MS*f 

(0 flS^: *X 

(ii) #ffi 

(A) **/^«ig s CDS 
(xi) SEQ ID NO. 1: 

GCCAAGATGG CGGCGGCGCG GCCGCGGGCG C CAGGGCG AC GGCGGCGGCT GAGGCTCTGG 
GCTCGCTGAA GCGTTGGCAC GTCGCGCTCT GGGCTCATGT AATCAAAGAA GTTTCTTTGT 



(b) mt 

(D) fliK, £tt 



(B) 



157-5275 



TGTGTGTATC TTCACAGAAC ACAACAGGAA TTGAAA ATG CAT CAG AAC ACT viAG 

Met His Gin Asn Thr Glu 
1 5 

TCT GTG GCA GCC ACT GAG ACT CTG GCT GAG GTA CCT GAA CAT GTG CTT 
Se- Val Ala Ala Thr Glu Thr Leu Ala Glu Val Pro Glu His Val Leu 
10 15 20 

CGA GGA CTT CCA GAG GAA GTA AGA CTT TTC CCA TCT GCA GTC GAC AAG 
Arg Gly Leu Pro Glu Glu Val Arg Leu Phe Pro Ser Ala Val Asp Lys 
25 3 0 
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ACT CGG ATT GGT GTC TGG GCT ACT AAA CCA ATT TTA AAA GGG AAA AAG 
Thr Arg He Gly Val Trp Ala Thr Lys Pro He Leu Lys Gly Lys Lys 
40 45 50 



TTT GGG CCA TTT GTT GGT GAT AAG AAG AAG AGA TCC CAG GTT AGG AAT 
Phe Gly Pro Phe Val Gly Asp Lys Lys Lys Arg Ser Gin Val Arg Asn 
55 60 65 70 

AAT GTG TAC ATG TGG GAG GTC TAC TAC CCA AAT TTG GGG TGG ATG TGC 
Asn Val Tyr Met Trp Glu Val Tyr Tyr Pro Asn Leu Gly Trp Met Cys 
75 -80 B5 

ATT GAT GCC ACC GAT CCG GAG AAG GGC AAC TGG CTA CGC TAT GTG AAC 
lie Asp Ala Thr Asp Pro Glu Lys Gly Asn Trp Leu Arg Tyr Val Asn 
* 90 95 100 

TGG GCT TGC TCA GGA GAA GAG CAG AAT TTA TTT CCA CTG GAA ATC AAC 
Trp Ala Cys Ser Gly Glu Glu Gin Asn Leu Phe Pro Leu Glu He Asn 
105 HO H5 

AGA GCC ATT TAC TAT AAA ACC TTA AAG CCA ATC GCG CCT GGC GAG GAG 
Arg Ala He Tyr Tyr Lys Thr Leu Lys Pro He Ala Pro Gly Glu Glu 
120 125 130 

CTC CTG GTC TGG TAC AAT GGG GAA GAC AAC CCT GAG ATA GCA GCT GCG 
Leu Leu Val Trp Tyr Asn Gly Glu Asp Asn Pro Glu He Ala Ala Ala 
135 140 145 150 

ATT GAG GAA GAG CGA GCC AGC GCC CGG AGC AAG CGG AGC TCC CCG AAG 
He Glu Glu Glu Arg Ala Ser Ala Arg Ser Lys Arg Ser Ser Pro Lys 
155 160 16S 

AGC CGC AGA GGG AAG AAG AAA TCA CAC GAG AAC AAA AAC AAA GGC ATC 
Ser Arg Arg Gly Lys Lvs Lvs Ser His Glu Asn Lys Asn Lys Gly II*? 
170 17S 180 



60 
120 
174 

222 

270 

318 

366 

414 

462 

510 

558 

606 

654 

702 
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AGA ACC CAC CCC ACA CAG CTG AAG GCA AGT GAG CTG GAC TCT ACC TTT 7 50 

Arg Thr His Pro Thr Gin Leu Lys Ala Ser Glu Leu Asp Ser Thr Phe 
185 190 195 

GCA AAC ATG AGG GGC TCT GCA GAA GGT CCA AAA GAA GAG GAT GAG AGG 798 
Ala Asn Met Arg Gly Ser Ala Glu Gly Pro Lys Glu Glu Asp Glu Arg 
200 J 205 210 

CCT TTG GCT TCG GCA CCT GAG CAG CCA GCC CCT CTG CCG GAG GTG GGG 846 
Pro Leu Ala Ser Ala Pro Glu Gin Pro Ala Pro Leu Pro Glu Val Gly 
215 220 225 230 

AAT CAA GAT GCA GTT CCA CAG GTG GCC ATC CCT CTC CCT GCC TGC GAG 8 94 

Asn Gin Asp Ala Val Pro Gin Val Ala He Pro Leu Pro Ala Cys Glu 
235 240 245 

CCA CAG CCA GAG GTA GAT GGG AAA CAA GAA GTC ACA GAC TGT GAG GTC 942 
Pro Gin Pro Glu Val Asp Gly Lys Gin Glu Val Thr Asp Cys Glu Val 
250 255 260 

AAT GAT GTG GAG GAA GAG GAG CTG GAA GAG GAA GAG GAG CTG GAA GAG 990 
Asn Asp Val Glu Glu Glu Glu Leu Glu Glu Glu Glu Glu Leu Glu Glu 
265 270 275 

GAG GAG GAG GAG GAG TTG GGA GAA GAT GGG GTA GAA GAA GCA GAC ATG 1038 
Glu Glu Glu Glu Glu Leu Gly Glu Asp Gly Val Glu Glu Ala Asp Met 

CCG AAT GAA AGC TCT GCG AAA GAG CCG GAG ATC CGG TGT GAA GAA AAG 1086 
Pro Asn Glu Ser Ser Ala Lys Glu Pro Glu He Arg Cys Glu Glu Lys 
295 300 305 310 

CCA GAA GAC TTA TTA GAA GAG CCA CAG AGC ATG TCG AAT GAA GCT CGT 1134 
Pro Glu Asp Leu Leu Glu Glu Pro Gin Ser Met Ser Asn Glu Ala Arg 
315 320 325 

GAA GAC TCT CCA GAC GTG ACC CCT CCT CCC CAC ACT CCC AGA GCT AGA 1182 
Glu Asp Ser Pro Asp Val Thr Pro Pro Pro His Thr Pro Arg Ala Arg 
330 335 340 

GAG GAG GCC AAC GGT GAT GTA CTT GAG ACA TTT ATG TTT CCG TGT CAG 1230 
Glu Glu Ala Asn Gly Asp Val Leu Glu Thr Phe Met Phe Pro Cys Gin 
345 * 350 355 

CAC TGT GAA' AGA AAA TTT GCA ACG AAG CAG GGG CTA GAG CGT CAC ATG 1278 
His Cys Glu Arg Lys Phe Ala Thr Lys Gin Gly Leu Glu Arg His Met 
360 365 370 

CAC ATC CAC ATT TCT ACC ATC AAT CAT GCT TTC AAG TGC AAG TAC TGT 1326 
His He His He Ser Thr He Asn His Ala Phe Lys Cys Lys Tyr Cys 
375 380 385 390 

GGG AAA CGG TTT GGC ACA CAG ATT AAC AGG AGG CGG CAT GAA CGG CGC 1374 
Gly Lys Arg Phe Gly Thr Gin He Asn Arg Arg Arg His Glu Arg Arg 
395 400 405 

CAC GAA ACG GGG TTG AAG AGA AGA CCC AGC ATG ACT TTA CAG TCC TCA 1422 
His Glu Thr Gly Leu Lys Arg Arg Pro Ser Met Thr Leu Gin Ser Ser 
410 415 420 

GAG GAT CCA GAT GAT GGC AAG GGG GAA AAT GTT ACT TCT AAA GAT GAG 14 70 

Glu Asp Pro Asp Asp Gly Lys Gly Glu Asn Val Thr Ser Lys Asp Glu 
425 430 435 

TCA AGT CCA CCT CAA CTC GGG CAA GAC TGT TTG ATA TTG AAC TCA GAG 1518 
Ser Ser Pro Pro Gin Leu Gly Gin Asp Cys Leu He Leu Asn Ser Glu 
440 445 450 



IB 



/ 



AAA ACC TCA CAG GAA GTA CTG AAT TCA TCT TTT GTG GAA GAA AAT GGT 1566 
Lys Thr Ser Gin Glu Val Leu Asn Ser Ser Phe Val Glu Glu Asn Gly 
455 460 465 470 

GAA GTT AAA GAA CTT CAT CCG TGC AAA TAC TGC AAA AAG GTA TTT GGA 1614 
Glu Val Lys Glu Leu His Pro Cys Lys Tyr Cys Lys Lys Val Phe Gly 
475 480 485 

ACT CAC ACC AAT ATG AGA CGA CAT CAG CGT AGA GTT CAT GAG CGC CAC 1662 
Thr His Thr Asn Met Arg Arg His Gin Arg Arg Val His Glu Arg His 
490 495 500 

CTG ATT CCC AAA GGT GTC AGG CGA AAA GGA GGA CTT CTG GAA GAG CCA 1710 
Leu lie Pro Lys Gly Val Arg Arg Lys Gly Gly Leu Leu Glu Glu Pro 
505 510 515 

CAG CCA CCA GGA GAG CAG GCT CCA CCC TCC CAG AAT GTC TAT GTC CCA 17 58 

Gin Pro Pro Ala Glu Gin Ala Pro Pro Ser Gin Asn Val Tyr Val Pro 
520 525 530 

AGC ACA GAG CCA GAG GAG GAA GGG GAA AGA GAT GAC GTG TAC ATC ATG 1806 
Ser Thr Glu Pro Glu Glu Glu Gly Glu Thr Asp Asp Val Tyr lie Met 
535 540 545 550 

GAC ATT TCT AGC AAC ATC TCT GAA AAC CTA AAT TAC TAT ATT GAC GGT 1854 
Asp He Ser Ser Asn He Ser Glu Asn Leu Asn Tyr Tyr He Asp Gly 
555 560 565 

AAG ATT CAG ACC AAC AGC AGC ACT AGT AAC TGT GAT GTG ATT GAG ATG 1902 
Lys He Gin Thr Asn Ser Ser Thr Ser Asn Cys Asp Val He Glu Met 
570 575 580 

GAG TCT AAT TCT GCA CAC TTG TAT GGC ATA GAC TGT CTG CTC ACT CCA 1950 
Glu Ser Asn Ser Ala His Leu Tyr Gly He Asp Cys Leu Leu Thr Pro 
585 590 595 

GTG ACC GTG GAG ATT ACT CAG AAC ATA AAG AGC ACT CAG GTC TCT GTG 1998 
Val Thr Val Glu He Thr Gin Asn He Lys Ser Thr Gin Val Ser Val 
600 605 610 

ACA GAT GAT CTT CTC AAA GAC TCT CCC AGC AGC ACA AAT TGT GAG TCT 204 6 

Thr Asp Asp Leu Leu Lys Asp Ser Pro Ser Ser Thr Asn Cys Glu Ser 
615 620 . 625 630 

AAG AAA CGG AGG ACT GCC AGT CCA CCT GTG CTC CCC AAA ATT AAA ACG 2094 
Lys Lys Arg Arg Thr Ala Ser Pro Pro Val Leu Pro Lys He Lys Thr 
635 640 645 

GAG ACG GAG TCT GAT TCC ACA GCA CCC TCG TGT TCC TTA AGT CTG CCC 214 2 

Glu Thr Glu Ser Asp Ser Thr Ala Pro Ser Cys Ser Leu Ser Leu Pro 
650 655 660 

CTG AGC ATA TCC ACA GCC GAG GTG GTG TCC TTC CAT AAA GAG AAG GGC 2190 
Leu Ser He Ser Thr Ala Glu Val Val Ser Phe His Lys Glu Lys Gly 
665 670 675 

GTC TAT TTG TCG TCC AAG CTC AAG CAG CTT CTT CAG ACC CAG GAC AAG 2238 
Val Tyr Leu Ser Ser Lys Leu Lys Gin Leu Leu Gin Thr Gin Asp Lys 
680 685 690 

CTG ACC CTT CCT GCA GGG TTT TCA GCA GCT GAG ATT CCT AAG TTA GGT 2286 
Leu Thr Leu Pro Ala Gly Phe Ser Ala Ala Glu He Pro Lys Leu Gly 
695 700 705 710 

CCC GTG TGC GCG TCT GCT CCT GCA TCC ATG TTG CCC GTG ACC TCT AGT 2 3 34 

Pro val Cys Ala Ser Ala Pro Ala Ser Met Leu Pro Val Thr Ser Ser 
715 720 725 



\ 



AGG TTT AAG AGA CGC ACC AGC TCT CCA CCG AGC TCT CCA CAG CAC AGC 2 382 

Arg Phe Lys Arg Arg Thr Ser Ser Pro Pro Ser Ser Pro Gin His Ser 
730 735 740 

CCT GCC CTT CGA GAC TTC GGG AAA CCA AAT GAT GGG AAA GCA GCA TGG 24 30 

Pro Ala Leu Arg Asp Phe Gly Lys Pro Asn Asp Gly Lys Ala Ala Trp 
745 750 755 

ACA GAC ACA GTC CTG ACT TCC AAG AAA CCC AAG TTA GAA AGT CGT AGT 24 78 

Thr Asp Thr Val Leu Thr Ser Lys Lys Pro Lys Leu Glu Ser Arg Ser 
760 765 770 

GAC TCA CCA GCA TGG AGT TTG TCT GGG AGA GAT GAA AGA GAA ACC GGA 2 526 

Asp Ser Pro Ala Trp Ser Leu Ser Gly Arg Asp Glu Arg Glu Thr Gly 
775 780 785 790 

AGC CCT CCT TGC TTT GAT GAA TAC AAA ATA TCA AAG GAA TGG GCA GCC 2 574 

Ser Pro Pro Cys Phe Asp Glu Tyr Lys He Ser Lys Glu Trp Ala Ala 
795 800 805 

AGT TCT ACT TTC AGC AGT GTG TGC AAC CAA CAG CCA TTG GAT TTA TCC 2622 
Ser Ser Thr Phe Ser Ser Val Cys Asn Gin Gin Pro Leu Asp Leu Ser 
810 815 820 

AGC GGG GTC AAA CAG AAG TCA GAG GGC ACA GGC AAG ACT CCA GTC CCA 2670 
Ser Gly Val Lys Gin Lys Ser Glu Gly Thr Gly Lys Thr Pro Val Pro 
825 830 835 

TGG GAA TCT GTA TTG GAT CTC AGT GTG CAT AAA AAG CCT TGC GAT TCT 2718 
Trp Glu Ser Val Leu Asp Leu Ser Val His Lys Lys Pro Cys Asp Ser 
840 845 850 

GAA GGC AAG GAA TTC AAA GAG AAC CAT TTG GCA CAG CCA GCT GCA AAG 2766 
Glu Gly Lys Glu Phe Lys Glu Asn His Leu Ala Gin Pro Ala Ala Lys 
855 860 865 870 

AAG AAA AAA CCA ACC ACC TGT ATG CTT CAA AAG GTT CTT CTC AAT GAG 2814 
Lys Lys Lys Pro Thr Thr Cys Met Leu Gin Lys Val Leu Leu Asn Glu 
7 875 * 880 885 

TAT AAT GGT GTT AGC TTA CCT ACA GAA ACC ACA CCA GAG GTG ACC AGG 2862 
Tyr Asn Gly Val Ser Leu Pro Thr Glu Thr Thr Pro Glu Val Thr Arg 
890 895 . 900 

AGC CCA AGT* CCT TGT AAA TCC CCA GAT ACA CAG CCA GAT CCT GAA CTT 2910 
Ser Pro Ser Pro Cys Lys Ser Pro Asp Thr Gin Pro Asp Pro Glu Leu 
905 910 915 

GGT CCT GAC TCA AGT TGC TCA GTC CCC ACT GCT GAG TCT CCA CCT GAA 2958 
Gly Pro Asp Ser Ser Cys Ser Val Pro Thr Ala Glu Ser Pro Pro Glu 
920 925 930 

GTT GTT GGC CCT TCC TCA CCC CCT CTC CAG ACA GCC TCC TTA TCC TCC 3006 
Val Val Gly Pro Ser Ser Pro Pro Leu Gin Thr Ala Ser Leu Ser Ser 
935 940 945 950 

GGT CAG CTG CCT CCT CTC TTA ACC CCC ACA GAG CCT TCT TCC CCT CCC 3054 
Gly Gin Leu Pro Pro Leu Leu Thr Pro Thr Glu Pro Ser Ser Pro Pro 
955 960 965 

CCC TGC CCT CCT GTG TTA ACT GTT GCC ACT CCA CCA CCT CCC CTC CTT 3102 
Pro Cys Pro Pro Val Leu Thr Val Ala Thr Pro Pro Pro Pro Leu Leu 
970 975 980 

CCA ACC GTC CCT CTC TCC CAC CCC TCT TCT GAT GCC TCC CCT CAG CAG 3150 
Pro Thr Val Pro Leu Ser His Pro Ser Ser Asp Ala Ser Pro Gin Gin 
985 990 995 



TGT CCC TCT CCC TTC TCA AAC ACC ACT GCT CAG TCT CCT CTT CCC ATT 3198 
Cys Pro Ser Pro Phe Ser Asn Thr Thr Ala Gin Ser Pro Leu Pro He 
1000 1005 1010 

CTC TCC CCA ACA GTG TCT CCC TCT CCC TCT CCC ATT CCT CCT GTA GAG 3246 
Leu Ser Pro Thr Val Ser Pro Ser Pro Ser Pro He Pro Pro Val Glu 
1015 1020 1025 1030 

CCA CTT ATG TCT GCT GCT TCC CCT GGT CCC CCA ACA CTT TCT TCC TCC 3 294 

Pro Leu Met Ser Ala Ala Ser Pro Gly Pro Pro Thr Leu Ser Ser Ser 
1035 1040 1045 

TCC TCT TCT TCC TCT TCC TTC CCT TCC TCT TCC TGC TCC TCC ACC TCC 3342 
Ser Ser Ser Ser Ser Ser Phe Pro Ser Ser Ser Cys Ser Ser Thr Ser 
1050 105S 1060 

CCC TCC CCA CCC CCT CTT TCA GCA GTG TCA TCT GTG GTT TCC TCT GGG 3390 
Pro Ser Pro Pro Pro Leu Ser Ala Val Ser Ser Val Val Ser Ser Gly 
1065 1070 1075 

GAC AAC CTG GAG GCA TCT CTG CCT GCA GTA ACT TTC AAA CAG GAG GAG 34 38 

Asp Asn Leu Glu Ala Ser Leu Pro Ala Val Thr Phe Lys Gin Glu Glu 
1080 1085 1090 

TCA GAG AGT GAA GGT CTG AAA CCC AAG GAA GAG GCC CCA CCT GCA GGG 34 86 

Ser Glu Ser Glu Gly Leu Lys Pro Lys Glu Glu Ala Pro Pro Ala Gly 
1095 HOO H05 1H0 

GGA CAG AGT GTG GTC CAA GAA ACA TTC AGC AAA AAC TTC ATT TGC AAT 3 534 

Glv Gin Ser Val Val Gin Glu Thr Phe Ser Lys Asn Phe He Cys Asn 
y 1115 1120 H25 

GTC TGT GAA TCG CCT TTT CTT TCC ATT AAA GAC CTA ACC AAA CAT TTA 3 582 

Val Cys Glu Ser Pro Phe Leu Ser He Lys Asp Leu Thr Lys His Leu 
1130 H35 11^0 

TCC GTC CAT GCT GAA GAG TGG CCC TTC AAA TGT GAG TTT TGT GTG CAG 3630 
Ser Val His Ala Glu Glu Trp Pro Phe Lys Cys Glu Phe Cys Val Gin 
1145 H50 1155 

CTG TTT AAG GTT AAG ACT GAT CTA TCA GAG CAT CGA TTT CTG CTT CAT 3676 
Leu Phe Lys Val Lys Thr Asp Leu Ser Glu His Arg Phe Leu Leu His 
1160 * H65 H70 

GGG GTT GGA *AT ATC TTT GTG TGT TCT GTA TGT AAG AAA GAA TTT GCC 3726 
Gly Val Gly Asn He Phe Val Cys Ser Val Cys Lys Lys Glu Phe Ala 
1175 H80 H85 113° 

TTC TTA TGC AAT CTG CAG CAG CAC CAG CGT GAT CTC CAC CCA GAT GAG 3774 
Phe Leu Cys Asn Leu Gin Gin His Gin Arg Asp Leu His Pro Asp Glu 
H95 1200 120S 

GTA TGC ACA CAC CAC GAG TTT GAA AGT GGG ACC CTG AGG CCC CAG AAC 3822 
Val Cys Thr His His Glu Phe Glu Ser Gly Thr Leu Arg Pro Gin Asn 
1210 1215 1220 

TTC ACA GAC CCC AGC AAG GCC AAT GTT GAG CAT ATG CCA AGT TTG CCA 38 70 

Phe Thr Asp Pro Ser Lys Ala Asn Val Glu His Met Pro Ser Leu Pro 
1225 1230 1235 

GAA GAG CCT TTA GAA ACT TCT AGA GAG GAG GAG TTA AAT GAT TCC TCT 3918 
Glu Glu Pro Leu Glu Thr Ser Arg Glu Glu Glu Leu Asn Asp Ser Ser 
1240 1245 12S0 

GAA GAG CTT TAC ACG ACC ATC AAA ATA ATG GCT TCT GGA ATA AAG ACG 3 966 

Glu Glu Leu Tyr Thr Thr lie Lys He Met Ala Ser Gly He Lys Thr 
1255 1260 1265 1270 



AAG GAT CCA GAT GTT CGA CTT GGT CTC AAC CAG CAC TAC CCG AGC TTT 4014 
Lys Asp Pro Asp Val Arg Leu Gly Leu Asn Gin His Tyr Pro Ser Phe 
1275 1280 I 285 

AAA CCT CCT CCA TTT CAG TAC CAC CAT CGA AAC CCT ATG GGG ATA GGG 4062 
Lys Pro Pro Pro Phe Gin Tyr His His Arg Asn Pro Met Gly He Gly 
1290 1295 1300 

GTG ACA GCC ACC AAC TTC ACC ACC CAC AAT ATT CCA CAG ACT TTC ACT 4110 
Val Thr Ala Thr Asn Phe Thr Thr His Asn He Pro Gin Thr Phe Thr 
1305 . 1310 1315 

ACT GCC ATC CGC TGC ACA AAG TGT GGG AAG GGC GTC GAC AAT ATG CCT 4158 
Thr Ala He Arg Cys Thr Lys Cys Gly Lys Gly Val Asp Asn Met Pro 
1320 ~ 1325 1330 

GAG CTG CAT AAG CAT ATC TTG GCG TGT GCG TCT GCA AGT GAC AAG AAG 4206 
Glu Leu His Lys His He Leu Ala Cys Ala Ser Ala Ser Asp Lys Lys 
1335 1340 1345 1350 

AGG TAC ACC CCT AAG AAA AAC CCA GTG CCC CTG AAA CAA ACT GTG CAG 4 254 

Arq Tyr Thr Pro Lys Lys Asn Pro Val Pro Leu Lys Gin Thr Val Gin 
1355 1360 1365 

CCC AAA AAT GGA GTG GTG GTT CTA GAC AAC TCT GGG AAA AAT GCC TTC 4 302 

Pro Lys Asn Gly Val Val Val Leu Asp Asn Ser Gly Lys Asn Ala Phe 
1370 1375 1380 

AGA CGG ATG GGG CAG CCC AAG AGA CTG AGC TTC AAT GTT GAA CTG GGT 4 3 50 

Arg Arg Met Gly Gin Pro Lys Arg Leu Ser Phe Asn Val Glu Leu Gly 
1385 1390 1395 

AAA ATG TCT CCA AAC AAG CTC AAG CTG AGT GCG CTG AAG AAG AAA AAC 4 3 98 

Lys Met Ser Pro Asn Lys Leu Lys Leu Ser Ala Leu Lys Lys Lys Asn 
1400 1405 1410 

CAG CTG GTG CAG AAG GCC ATC CTT CAG AAG AAC AGA GCC GCG AAG CAG 44 46 

Gin Leu Val Gin Lys Ala He Leu Gin Lys Asn Arg Ala Ala Lys Gin . 
1415 1420 1425 1430 

AAG GCG GAC CTG AGG GAT ACT- TCC GAG GCG TCC TCA CAC ATC TGC CCG 4494 
Lys Ala Asp Leu Arg Asp Thr Ser Glu Ala Ser Ser His lie Cys Pro 
1 1435 1440 1445 

TAC TGT GAC AGG GAG TTC ACA TAC ATT GGC AGC CTG AAT AAG CAT GCC 4542 
Tyr Cys Asp Arg Glu Phe Thr Tyr He Gly Ser Leu Asn Lys His Ala 
14S0 1455 1460 

GCC TTC AGC TGT CCT AAA AAA CCT CTT TCT CCT TCC AAA AGA AAA GTT 4590 
Ala Phe Ser Cys Pro Lys Lys Pro Leu Ser Pro Ser Lys Arg Lys Val 
1465 1470 1475 

TCC CAT TCG TCT AAG AAA GGT GGC CAT GCA TCA TCT TCT AGC AGT GAC 
Ser His Ser Ser Lys Lys Gly Gly His Ala Ser Ser Ser Ser Ser Asp 
1480 1485 1490 

AGA AAC AGC AGC TGC CAC CCC CGG AGG CGG ACT GCA GAT ACC GAG ATC 
Arg Asn Ser Ser Cys His Pro Arg Arg Arg Thr Ala Asp Thr Glu He 
149S 1500 1505 1510 

AAG ATG CAG AGC ACG CAG GCA CCC TTG GGC AAG ACC AGA GCT CGG AGT 
Lys Met Gin Ser Thr Gin Ala Pro Leu Gly Lys Thr Arg Ala Arg Ser 
1S15 1520 I 525 

ACA GGC CCC GCC CAG GCC TCA CTG CCC TCC TCG TCC TTC AGA TCC AGA 
Thr Gly Pro Ala Gin Ala Ser Leu Pro Ser Ser Ser Phe Arg Ser Arg 
1530 1535 1540 



4638 



4686 



4734 



4782 
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CAG AAT GTC AAA TTT GCA GCT TCA GTG AAA TCC AAA AAA GCA AGC TCT 4830 
Gin Asn val Lys Phe Ala Ala Ser Val Lys Ser Lys Lys Ala Ser Ser 
1545 1550 1555 

TCA TCC TTG AGG AAT TCC AGT CCC ATA AGA ATG GCC AAA ATT ACT CAC 4 878 

Ser Ser Leu Arg Asn Ser Ser Pro He Arg Met Ala Lys He Thr His 
1560 1565 1570 

GTC GAG GGC AAA AAA CCC AAA GCT GTT GCC AAG AGT CAT TCT GCT CAG 4 926 

Val Glu Gly Lys Lys Pro Lys Ala Val Ala Lys Ser His Ser Ala Gin 
1575 1580 1585 1590 

CTC TCA AGC AAA TCC TCC CGA GGC CTG CAT GTC AGA GTG CAG AAG AGC 4 974 

Leu Ser Ser Lys Ser Ser Arg Gly Leu His Val Arg Val Gin Lys Ser 
1595 1600 1605 

AAA GCT GTC ATA CAG AGC AAG ACT GCC CTG GCC AGT AAG AGG AGA ACA 5022 
Lys Ala Val He Gin Ser Lys Thr Ala Leu Ala Ser Lys Arg Arg Thr 
1610 1615 1620 

GAC CGG TTC ATA GTG AAA TCT AGA GAG CGC AGC GGG GGC CCA ATC ACC 5070 
Asp Arg Phe He Val Lys Ser Arg Glu Arg Ser Gly Gly Pro He Thr 
1625 1630 1635 

CGA AGC CTT CAG CTG GCA GCT GCT GCG GAC CTG AGT GAA AGC AGG AGA 5118 
Arg Ser Leu Gin Leu Ala Ala Ala Ala Asp Leu Ser Glu Ser Arg Arg 
1640 1645 1650 

GAG GAC AGC AGT GCC AGG CAT GAG CTG AAG GAC TTC AGC TAC AGT CTC 5166 
Glu Asp Ser Ser Ala Arg His Glu Leu Lys Asp Phe Ser Tyr Ser Leu 
1655 1660 1665 1670 

CGC CTG GCA TCT CGA TGC GGC TCA TCA ACA GCC TCT TAC ATC ACC AGA 5214 
Arg Leu Ala Ser Arg Cys Gly Ser Ser Thr Ala Ser Tyr He Thr Arg 
1675 1680 1685 

CAA TGC AGA AAG GTC AAG GCC GCC GCA GCA ACT CCG TTC CAG GGA CCC 5262 
Gin Cys Arg Lys Val Lys Ala Ala Ala Ala Thr Pro Phe Gin Gly Pro 
1690 1695 1700 

TTC CTC AAA GAG T AGGCACTCTG TCTGCTCCTT AACAGCACCT GAAGTGACCT 5315 
Phe Leu Lys Glu 
1705 

GGAATCAGTG AAGCCAAAGG GACCAGCAGT CTGCCCTGCA G AGAG CACTG AC CTC TCC CA 5375 

GTTGTGAGAG TGAGAGAACG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG AGAGAGAGAG 543 5 

AGAATGAGAA TGTGTGTGTG TGTGTGCTGG TGCATGTGTG TGGTCTTCAA GCCAAGGTCC 54 95 

CAGCCTCAGG AGCAGGACCT TCCCATTTCC CGTCATCCTC TGGATGATCC TTGGACGTGG 5555 

CCCAGAACCG TGCTCTGTGG TG CAG C CATC CTGCCCGGGA GGGGCATCTC CTTCTATGCA 5615 

ATTTTTTTAA AGAG TTC CTT GGCCCTGCTT TGTGCTTCTT GAGCTGTCCG TTTGCCACCA 567 5 

CTGGGACTTG GATCTGGCCC TGAGGGGTGG GGAAGAGGGC CTATCTAAGG ATAACCTTTC 573 5 

AGAGGTCAAG CTCCCCTTCA TGCCACCCCT CCCCCCTGCC CTCACCGACC TTTTCCCCAC 5795 

ACTGTCTCTG GGAATCAATA GCAGATAGCA TATAGATCCA TCAGGGTTGA GCCTGAACCT 58 55 

CGGCCCTAGC ACTAGGAAAT CCCC CTTTT C TCCCTAAGCA ACTGGAGCCG CCAGCTTTCA S915 

AGTCATTTCC TCCTTTGAGG TTCTAGAGTC CGAGAGTCTG CTCCGAAGTC TCTCCTGGGA 5975 

ACCCGGGAGC CCTCGCACCC AGGACGCAGA CTCTGTGCCC ATTCTTAGAC CTGAGGTAGA 6035 



6171 



AGAAGCAGTG TTTTCGGACG ATAGGGTGGA GGCGTGCCTA CTTTGTCTCC TCTGGTGGGA 609S 
CCTCCTACAT CATTGGCATC TGAACCTTGC AAGTTCGCTG CAAAGAGAAG CAAAGGAAAA 615S 
AAAAAAAAAA AAAAAA 

SEQ ID NO. 2: 
(i) 

(a) ^S: 1076 n.mm (b) as® 

•(D) gttt 

(ii) saM 

Met His Gin Asn Thr Glu Ser Val Ala Ala Thr Glu Thr Leu Ala Glu 
1 5 . 10 15 

Val Pro Glu His Val Leu Arg Gly Leu Pro Glu Glu Val Arg Leu Phe 
20 25 30 

Pro Ser Ala Val Asp Lys Thr Arg He Gly Val Trp Ala Thr Lys Pro 
35 40 45 

He Leu Lys Gly Lys Lys Phe Gly Pro Phe Val Gly Asp Lys Lys Lys 
50 ' SS 60 

Arg Ser Gin Val Arg Asn Asn Val Tyr Met Trp Glu Val Tyr Tyr Pro 
65 70 '5 80 

Asn Leu Gly Trp Met Cys lie Asp Ala Thr Asp Pro Glu Lys Gly Asn 
85 90 " 

Trp Leu Arg Tyr Val Asn Trp Ala Cys Ser Gly Glu Glu Gin Asn Leu 
" 100 ios "° 

Phe Pro Leu Glu He Asn Arg Ala He Tyr Tyr Lys Thr Leu Lys Pro 
115 120 125 

He Ala Pro Gly Glu Glu Leu Leu Val Trp Tyr Asn Gly Glu Asp Asn 
130 135 KO 

Pro Glu He Ala Ala Ala He Glu Glu Glu Arg Ala Ser Ala Arg Ser 
14S 150 155 160 

Lys Arg Ser Ser Pro Lys Ser Arg Arg Gly Lys Lys Lys Ser His Glu 
165 I 70 l' S 

Asn Lys Asn- Lys Gly He Arg Thr His Pro Thr Gin Leu Lys Ala Ser 
180 I 85 lsu 

Glu Leu Asp Ser Thr Phe Ala Asn Met Arg Gly Ser Ala Glu Gly Pro 
l9S 200 205 

Lya Glu Glu Asp Glu Arg Pro Leu Ala Ser Ala Pro Glu Gin Pro Ala 
210 HS 220 

Pro Leu Pro Glu Val Gly Asn Gin Asp Ala Val Pro Gin Val Ala lie 
225 230 23S " 

Pro Leu Pro Ala Cys Glu Pro Gin Pro Glu Val Asp Gly Lys Gin Glu 
245 250 2 " 



Val Thr Asp Cys Glu Val Asn Asp Val Glu Glu Glu Glu Leu Glu Glu 

265 * w 



260 



Glu Glu Glu Leu Glu Glu Glu Glu Glu Glu Glu Leu Gly Glu Asp Gly 
275 280 285 

Val Glu Glu Ala Asp Met Pro Asn Glu Ser Ser Ala Lys Glu Pro Glu 
290 295 300 

He Arg Cys Glu Glu Lys Pro Glu Asp Leu Leu Glu Glu Pro Gin Ser 



305 



310 



315 



Met Ser Asn Glu Ala Arg Glu. Asp Ser Pro Asp Val Thr Pro Pro Pro 
325 330 335 

His Thr Pro Arg Ala Arg Glu Glu Ala Asn Gly Asp Val Leu Glu Thr 
340 345 350 

Phe Met Phe Pro Cys Gin His Cys Glu Arg Lys Phe Ala Thr Lys Gin 
355 360 365 

Gly Leu Glu Arg His Met His He Hie lie Ser Thr lie Asn His Ala 
370 375 380 

Phe Lys Cys Lys Tyr Cys Gly Lys Arg Phe Gly Thr Gin lie Asn Arg 

Arg Arg His Glu Arg Arg His Glu Thr Gly Leu Lys Arg Arg Pro Ser 
4 Q5 410 

Met Thr Leu Gin Ser Ser Glu Asp Pro Asp Asp Gly Lys Gly Glu Asn 

425 430 



420 



Val Thr Ser Lys Asp Glu Ser Ser Pro Pro Gin Leu Gly Gin Asp Cys 
435 440 445. 

Leu lie Leu Asn Ser Glu Lys Thr Ser Gin Glu Val Leu Asn Ser Ser 



450 



455 



460 



Phe Val Glu Glu Asn Gly Glu Val Lys Glu Leu His Pro Cys Lys Tyr 

475 



465 



470 



Cys Lys Lys Val Phe Gly Thr His Thr Asn Met Arg Arg His Gin Arg 
485 49<} 9 



Arg Val His <5lu Arg His Leu lie Pro Lys Gly Val Arg Arg Lys Gly 
500 505 5 



Glv Leu Leu Glu Glu Pro 

515 520 



Gin Pro Pro Ala Glu Gin Ala Pro Pro Ser 

525 



Gin Asn Val Tyr Val Pro Ser Thr Glu Pro Glu Glu Glu Gly Glu Thr 
530 535 540 

Asp Asp Val Tyr He Met Asp lie Ser Ser Asn He Ser Glu Asn Leu 
545 550 555 

Asn Tyr Tyr He Asp Gly Lys He Gin Thr Asn Ser Ser Thr Ser Asn 



56S 



570 



Cvs Asp Val He Glu Met Glu Ser Asn Ser Ala His Leu Tyr Gly He 
7 580 585 590 

Asp Cys Leu Leu Thr Pro Val Thr Val Glu He Thr Gin Asn He Lys 
595 600 605 



Ser Thr Gin Val Ser Val Thr Asp Asp Leu Leu Lys Asp Ser Pro Ser 
610 615 620 

Ser Thr Asn Cys Glu Ser Lys Lys Arg Arg Thr Ala Ser Pro Pro Val 
625 €30 635 640 

Leu Pro Lys He Lys Thr Glu Thr Glu Ser Asp Ser Thr Ala Pro Ser 
645 650 655 

Cys Ser Leu Ser Leu Pro Leu Ser He Ser Thr Ala Glu Val Val Ser 
660 665 670 

Phe His Lys Glu Lys Gly Val Tyr Leu Ser Ser Lys Leu Lys Gin Leu 
675 680 685 

Leu Gin Thr Gin Asp Lys Leu Thr Leu Pro Ala Gly Phe Ser. Ala Ala 
690 695 700 

Glu He Pro Lys Leu Gly Pro Val Cys Ala Ser Ala Pro Ala Ser Met 
705 ' 710 715 720 

Leu Pro val Thr Ser Ser Arg Phe Lys Arg Arg Thr Ser Ser Pro Pro 
725 * 730 735 

Ser Ser Pro Gin His Ser Pro Ala Leu Arg Asp Phe Gly Lys Pro Asn 
740 745 750 

Asp Gly Lys Ala Ala Trp Thr Asp Thr Val Leu Thr Ser Lys Lys Pro 
755 760 765 

Lys Leu Glu Ser Arg Ser Asp Ser Pro Ala Trp Ser Leu Ser Gly Arg 
770 775 780 

Asp Glu Arg Glu Thr Gly Ser Pro Pro Cys Phe Asp Glu Tyr Lys lie 
785 790 795 800 

Ser Lys Glu Trp Ala Ala Ser Ser Thr Phe Ser Ser Val Cys Asn Gin 
805 810 815 

Gin Pro Leu Asp Leu Ser Ser Gly Val Lys Gin Lys Ser Glu Gly Thr 
820 825 830 

Gly Lys Thr Pro Val Pro Trp Glu Ser Val Leu Asp Leu Ser Val His 
835 840 84S 

Lys Lys Pro Cys Asp Ser Glu Gly Lys Glu Phe Lys Glu Asn His Leu 
850 855 860 

Ala Gin Pro Ala Ala Lys Lys Lys Lys Pro Thr Thr Cys Met Leu Gin 
865 870 875 »80 

Lys val Leu Leu Asn Glu Tyr Asn Gly Val Ser Leu Pro Thr Glu Thr 
885 890 895 

Thr Pro Glu Val Thr Arg Ser Pro Ser Pro Cys Lys Ser Pro Asp Thr 
900 90S 910 

Gin Pro Asp Pro Glu Leu Gly Pro Asp Ser Ser Cys Ser Val Pro Thr 
915 920 925 

Ala Glu Ser Pro Pro Glu Val Val Gly Pro Ser Ser Pro Pro Leu Gin 
930 935 940 

Thr Ala Ser Leu Ser Ser Gly Gin Leu Pro Pro Leu Leu Thr Pro Thr 
945 950 955 960 



±6 



7 



Glu Pro Ser Ser Pro Pro Pro Cys Pro Pro Val Leu Thr Val Ala Thr 
965 970 975 

Pro Pro Pro Pro Leu Leu Pro Thr Val Pro Leu Ser His Pro Ser Ser 
980 985 990 

Asp Ala Ser Pro Gin Gin Cys Pro Ser Pro Phe Ser Asn Thr Thr Ala 
995 1000 1005 

Gin Ser Pro Leu Pro He Leu Ser Pro Thr Val Ser Pro Ser Pro Ser 
1010 1015 1020 

Pro He Pro Pro Val Glu Pro Leu Met Ser Ala Ala Ser Pro. Gly Pro 
1025 1030 1035 1040 

Pro Thr Leu Ser Ser Ser Ser Ser Ser Ser Ser Ser Phe Pro Ser Ser 
1045 1050 1055 

Ser Cys Ser Ser Thr Ser Pro Ser Pro Pro Pro Leu Ser Ala Val Ser 
1060 1065 1070 

Ser Val Val Ser Ser Gly Asp Asn Leu Glu Ala Ser Leu Pro Ala Val 
1075 1080 1085 

Thr Phe Lys Gin Glu Glu Ser Glu Ser Glu Gly Leu Lys Pro Lys Glu 
1090 1095 1100 

Glu Ala Pro Pro Ala Gly Gly Gin Ser Val Val Gin Glu Thr Phe Ser 
1105 1110 1115 1120 

Lys Asn Phe He Cys Asn Val Cys Glu Ser Pro Phe Leu Ser He Lys 
1125 1130 1135 

Asp Leu Thr Lys His Leu Ser Val His Ala Glu Glu Trp Pro Phe Lys 
1140 1145 1150 

Cys Glu Phe Cys Val Gin Leu Phe Lys Val Lys Thr Asp Leu Ser Glu 
1155 1160 1165 

His Arg Phe Leu Leu His Gly Val Gly Asn He Phe Val Cys Ser Val 
1170 1175 1180 

Cys Lys Lys Glu Phe Ala Phe Leu Cys Asn Leu Gin Gin His Gin Arg 
1185 1190 113S 1200 

Asp Leu His Pro Asp Glu Val Cys Thr His His Glu Phe Glu Ser Gly 
1205 1210 1215 

Thr Leu Arg Pro Gin Asn Phe Thr Asp Pro Ser Lys Ala Asn Val Glu 
1220 1225 1230 

His Met Pro Ser Leu Pro Glu Glu Pro Leu Glu Thr Ser Arg Glu Glu 
1235 1240 1245 

Glu Leu Asn Asp Ser Ser Glu Glu Leu Tyr Thr Thr He Lys He Met 
1250 1255 1260 

Ala Ser Gly He Lys Thr Lys Asp Pro Asp Val Arg Leu Gly Leu Asn 
1265 1270 1275 1280 

Gin His Tyr Pro Ser Phe Lys Pro Pro Pro Phe Gin Tyr His His Arg 
1285 1290 1295 

Asn Pro Met Gly He Gly Val Thr Ala Thr Asn Phe Thr Thr His Asn 
1300 1305 1310 



He Pro Gin Thr Phe Thr Thr Ala lie Arg Cys Thr Lys Cys Gly Lys 
1315 1320 1325 

Gly Val Asp Asn Met Pro Glu Leu His Lys His He. Leu Ala Cys Ala 
1330 1335 1340 

Ser Ala Ser Asp Lys Lys Arg Tyr Thr Pro Lys Lys Asn Pro Val Pro 
1345 1350 1355 1360 

Leu Lys Gin Thr Val Gin Pro Lys Asn Gly Val Val Val Leu Asp Asn 
1365 1370 1375 

Ser Gly Lys Asn Ala Phe Arg Arg Met Gly Gin Pro Lys Arg Leu Ser 
1380 1385 1390 

Phe Asn Val Glu Leu Gly Lys Met Ser Pro Asn Lys Leu Lys Leu Ser 
1395 1400 1405 

Ala Leu Lys Lys Lys Asn Gin Leu Val Gin Lys Ala lie Leu Gin Lys 
1410 1415 1420 

Asn Arg Ala Ala Lys Gin Lys Ala Asp Leu Arg Asp Thr Ser Glu Ala 
1425 1430 1435 1440 

Ser Ser His lie Cys Pro Tyr Cys Asp Arg Glu Phe Thr Tyr He Gly 
1445 1450 14S5 

Ser Leu Asn Lys His Ala Ala Phe Ser Cys Pro Lys Lys Pro Leu Ser 
1460 1465 1470 

Pro Ser Lys Arg Lys Val Ser His Ser Ser Lys Lys Gly Gly His Ala 
1475 1480 1485 

Ser Ser Ser Ser Ser Asp Arg Asn Ser Ser Cys His Pro Arg Arg Arg 
1490 1495 1500 

Thr Ala Asp Thr Glu He Lys Met Gin Ser Thr Gin Ala Pro Leu Gly 
1505 1510 ISIS 1520 

Lys Thr Arg Ala Arg Ser Thr Gly Pro Ala Gin Ala Ser Leu Pro Ser 
1525 1530 1535 

Ser Ser Phe Arg Ser Arg Gin Asn Val Lys Phe Ala Ala Ser Val Lys 
1540 1545 1550 

Ser Lys Lys % Ala Ser Ser Ser Ser Leu Arg Asn Ser Ser Pro He Arg 
1555 1560 1565 

Met Ala Lys He Thr His Val Glu Gly Lys Lys Pro Lys Ala Val Ala 
1570 157S 1580 

Lys Ser His Ser Ala Gin Leu Ser Ser Lys Ser Ser Arg Gly Leu His 
1585 1590 1595 1600 

Val Arg Val Gin Lys Ser Lys Ala Val He Gin Ser Lys Thr Ala Leu 
1605 1610 1615 

Ala Ser Lys Arg Arg Thr Asp Arg Phe He Val Lys Ser Arg Glu Arg 
1620 1625 1630 

Ser Gly Gly Pro He Thr Arg Ser Leu Gin Leu Ala Ala Ala Ala Asp 
1635 1640 1645 

Leu Ser Glu Ser Arg Arg Glu Asp Ser Ser Ala Arg His Glu Leu Lys 
1650 1655 1660 



^8 



I 

! 

Asp Phe Ser Tyr Ser Leu Arg Leu Ala Ser Arg Cys Gly Ser Ser Thr 
166S 1670 1675 "80 

Ala Ser Tyr lie Thr Arg Gin Cys Arg Lys Val Lys Ala Ala Ala Ala 
1685 1690 169S 

Thr Pro Phe Gin Gly Pro Phe Leu Lys. Glu 
1700 1705 

SEQ ID NO. 3: 

(i) mm®-- 

(a) 5868 ®ai*t (b) m®. 

(C) Wife: *X (D) Stts £tt 

(ii) '#tE 

(A) CDS (B) ft*: 121-5278 

GGAAAATTTA TTCCCACTGG AAATCAACAG AGCCATTTAC TATAAAACTT TAAAGGGTTC 

ATGTAATCAA AGAAGTTTCT TGTGTGTGTG TATCTTTACA GAACACAACA GGAATTGAAA 

ATG AAT CAG AAC ACT ACT GAG CCT GTG GCG GCC ACC GAG ACC CTG CCT 
Met £sn Gin Asn Thr Thr Glu Pro Val Ala Ala Thr Glu Thr Leu Ala 
5 10 15 



1 



GAG GTA CCC GAA CAT GTG CTG CGA GGA CTT CCG GAG GAA GTG AGG CTT 
Glu val Pro Glu His Val Leu Arg Gly Leu Pro Glu Glu Val Arg Leu 
20 25 30 

TTC CCT TCT GCT GTT GAC AAG ACC CGG ATT GGT GTC TGG GCC ACT AAA 
She Pro Ser Ala Val Asp Lys Thr Arg lie Gly Val Trp Ala Thr Lys 
3S 40 45 

CCA ATT TTA AAA GGG AAA AAA TTT GGG CCA TTT GTT GGT GAT AAG AAA 
Pro lie Leu Lys Gly Lys Lys Phe Gly Pro Phe Val Gly Asp Lys Lys 
50 ss 60 

AAA AGA TCT CAG GTT AAG AAT AAT GTA TAC ATG TGG GAG GTG TAT TAC 
Lys Arg Ser Gin Val Lys Asn Asn Val Tyr Met Trp Glu Val Tyr Tyr 



65 " 70 75 



CCA AAT TTG GGA TGG ATG TGC ATT GAT GCC ACT GAT CCA GAG AAG GGA 
m! Leu Gly Trp Met Cys lie Asp Ala Thr Asp Pro Glu Lys Gly 
85 90 * s 

AAC TGG CTG CGA TAT GTG AAT TGG GCT TGC TCA GGA GAA GAG CAA AAT 
£sn ?£ Leu Arg Tyr Val Asn Trp Ala Cys Ser Gly Glu Glu Gin Asn 



TTA TTC CCA CTG GAA ATC AAC AGA GCC ATT TAC TAT AAA ACT TTA AAG 
Leu Phe Pro Leu Glu He Asn Arg Ala He Tyr Tyr Lys Thr Leu Lys 

120 125 



IIS 



CCA ATC GCG CCG GGC GAG GAG CTC CTG GTC TGG TAC AAT GGG GAA GAC 
Pro He Ala Pro Gly Glu Glu Leu Leu Val Trp Tyr Asn Gly Glu Asp 

135 140 



60 
120 
168 

216 

264 

312 

360 

408 

456 

504 

552 



130 



AAC CCT GAG ATA GCA GCT GCG ATT GAG GAA GAG CGA GCC AGC GCC CGG 
Asn Pro Glu lie Ala Ala Ala lie Glu Glu Glu Arg Ala Ser Ala Arg 
145 150 155 160 

AGC AAG CGG AGC TCC CCC AAG AGC CGG AAA GGG AAG AAA AAA TCC CAG 
Ser Lys Arg Ser Ser Pro Lys Ser Arg Lys Gly Lys Lys Lys Ser Gin 
165 170 175 

GAA AAT AAA AAC AAA GGA AAC AAA ATC CAA GAC ATA CAA CTG AAG ACA 
Glu Asn Lys Asn Lys Gly Asn Lys He Gin Asp He Gin Leu Lys Thr 
180 18S 190 

AGT GAG CCA GAT TTC ACC TCT GCA AAT ATG AGA GAT TCT GCA GAA GGT 
Ser Glu Pro Asp Phe Thr Ser Ala Asn Met Arg Asp Ser Ala Glu Gly 
195 200 205 

CCT AAA GAA GAC GAA GAG AAG CCT TCA GCC TCA GCA CTT GAG CAG CCG 
Pro Lys Glu Asp Glu Glu Lys Pro Ser Ala Ser Ala Leu Glu Gin Pro 
210 215 220 

GCC ACC CTC CAG GAG GTG GCC AGT CAG GAG GTG CCT CCA GAA CTA GCA 
Ala Thr Leu Gin Glu Val Ala Ser Gin Glu Val Pro Pro Glu Leu Ala 
225 230 235 240 

ACC CCT GCC CCT GCC TGG GAG CCA CAG CCA GAA CCA GAC GAG CGA TTA 
Thr Pro Ala Pro Ala Trp Glu Pro Gin Pro Glu Pro Asp Glu Arg Leu 
245 250 255 

GAA GCG GCA GCT TGT GAG GTG AAT GAT TTG GGG GAA GAG GAG GAG GAG 
Glu Ala Ala Ala Cys Glu Val Asn Asp Leu Gly Glu Glu Glu Glu Glu 
260 265 270 

GAA GAG GAG GAG GAT GAA GAA GAA GAA GAA GAT GAT GAT GAT GAT GAG 
Glu Glu Glu Glu Asp Glu Glu Glu Glu Glu Asp Asp Asp Asp Asp Glu 
275 280 285 

TTG GAA GAC GAG GGG GAA GAA GAA GCC AGC ATG CCA AAT GAA AAT TCT 
Leu Glu Asp Glu Gly Glu Glu Glu Ala Ser Met Pro Asn Glu Asn Ser 
290 295 300 

GTG AAA GAG CCA GAA ATA CGG TGT GAT GAG AAG CCA GAA GAT TTA TTA 
Val Lys Glu Pro Glu He Arg Cys Asp Glu Lys Pro Glu Asp Leu Leu 
305 310 315 320 

GAG GAA CCA AAA ACA ACT TCA GAA GAA ACT CTT GAA GAC TGC TCA GAG 
Glu Glu Pro* Lys Thr Thr Ser Glu Glu Thr Leu Glu Asp Cys Ser Glu 
325 330 335 

GTA ACA CCT GCC ATG CAA ATC CCC AGA ACT AAA GAA GAG GCC AAT GGT 
Val Thr Pro Ala Met Gin He Pro Arg Thr Lys Glu Glu Ala Asn Gly 
340 345 350 

GAT GTA TTT GAA ACG TTT ATG TTT CCG TGT CAA CAT TGT GAA AGG AAG 
Asp Val Phe Glu Thr Phe Met Phe Pro Cys Gin His Cys Glu Arg Lys 
355 360 365 

TTT ACA ACC AAA CAG GGG CTT GAG CGT CAC ATG CAT ATC CAT ATA TCC 
Phe Thr Thr Lys Gin Gly Leu Glu Arg His Met His He His He Ser 
370 375 380 

ACC GTC AAT CAT GCT TTC AAA TGC AAG TAC TGT GGG AAA GCC TTT GGC 
Thr Val Asn His Ala Phe Lys Cys Lys Tyr Cys Gly Lys Ala Phe Gly 
385 390 395 400 



ACA CAG ATT AAC CGG CGG CGA CAT GAG CGG CGC CAT GAA GCA GGG TTA 
Thr Gin He Asn Arg Arg Arg His Glu Arg Arg His Glu Ala Gly Leu 
405 410 415 



AAG CGG AAA CCC AGC CAA ACA CTA CAG CCG TCA GAG GAT CTG GCT GAT 
Lys Arg Lys Pro Ser Gin Thr Leu Gin Pro Ser Glu Asp Leu Ala Asp 
420 425 430 



GGC AAA GCA TCT GGA GAA AAC GTT GCT TCA AAA GAT GAT TCG AGT CCT 
Gly Lys Ala Ser Gly Glu Asn Val Ala Ser Lys Asp Asp Ser Ser Pro 
435 440 44S 

CCC AGT CTT GGG CCA GAC TGT CTG ATC ATG AAT TCA GAG AAG GCT TCC 
Pro Ser Leu Gly Pro Asp Cys Leu lie Met Asn Ser Glu Lys Ala Ser 
450 ' 455 460 

CAA GAC ACA ATA AAT TCT TCT GTC GTA GAA GAG AAT GGG GAA GTT AAA 
Gin Asp Thr lie Asn Ser Ser Val Val Glu Glu Asn Gly Glu Val Lys 
465 470 475 480 

GAA CTT CAT CCG TGC AAA TAT TGT AAA AAG GTT TTT GGA ACT CAT ACT 
Glu Leu His Pro Cys Lys Tyr Cys Lys Lys Val Phe Gly Thr His Thr 
485 490 495 

AAT ATG AGA CGG CAT CAG CGT AGA GTT CAC GAA CGT CAT CTG ATT CCC 
Asn Met Arg Arg His Gin Arg Arg Val His Glu Arg His Leu He Pro 
500 505 510 

AAA GGT GTA CGG CGA AAA GGA GGC CTT GAA GAG CCC CAG CCT CCA GCA 
Lys Gly Val Arg Arg Lys Gly Gly Leu Glu Glu Pro Gin Pro Pro Ala 
515 520 525 

GAA CAG GCC CAG GCC ACC CAG AAC GTG TAT GTA CCA AGC ACA GAG CCG 
Glu Gin Ala Gin Ala Thr Gin Asn Val Tyr Val Pro Ser Thr Glu Pro 
530 535 540 

GAG GAG GAA GGG GAA GCA GAT GAT GTG TAC ATC ATG GAC ATT TCT AGC 
Glu Glu Glu Gly Glu Ala Asp Asp Val Tyr He Met Asp He Ser Ser 
545 550 555 560 

AAT ATC TCT GAA AAC TTA AAT TAC TAT ATT GAT GGT AAA ATT CAA ACT 
Asn He Ser Glu Asn Leu Asn Tyr Tyr He Asp Gly Lys He Gin Thr 
565 570 575 

AAT AAC AAC ACT AGT AAC TGT GAT GTG ATT GAG ATG GAG TCT GCT TCG 
Asn Asn Asn Thr Ser Asn Cys Asp Val He Glu Met Glu Ser Ala Ser 
580 585 590 

GCA GAT TTG TAT GGT ATA AAT TGT CTG CTC ACT CCA GTT ACA GTG GAA 
Ala Asp Leu Tyr Gly lie Asn Cys Leu Leu Thr Pro Val Thr Val Glu 
595 600 605 

ATT ACT CAA AAT ATA AAG ACC ACA CAG GTC CCT GTA ACA GAA GAT CTT 
He Thr Gin Asn He Lys Thr Thr Gin Val Pro Val Thr Glu Asp Leu 
610 615 620 

CCT AAA GAG CCT TTG GGC AGC ACA AAT AGT GAG GCC AAG AAG CGG AGA 
Pro Lys Glu Pro Leu Gly Ser Thr Asn Ser Glu Ala Lys Lys Arg Arg 
625 630 635 640 

ACT GCG AGC CCA CCT GCA CTG CCC AAA ATT AAG GCC GAA ACA GAC TCT 
Thr Ala Ser Pro Pro Ala Leu Pro Lys He Lys Ala Glu Thr Asp Ser 
645 650 655 

GAC CCC ATG GTC CCC TCT TGC TCT TTA AGT CTT CCT CTT AGC ATA TCA 
Asp Pro Met Val Pro Ser Cys Ser Leu Ser Leu Pro Leu Ser He Ser 
660 665 670 

ACA ACA GAG GCA GTG TCT TTC CAC AAA GAG AAA AGT GTT TAT TTG TCA 
Thr Thr Glu Ala Val Ser Phe His Lys Glu Lys Ser Val Tyr Leu Ser 
675 680 685 



5>J 



TCA AAG CTC AAA CAA CTT CTT CAA ACC CAA GAT AAA CTA ACT CCT CCT 2232 
Ser Lys Leu Lys Gin Leu Leu Gin Thr Gin Asp Lya Leu Thr- Pro Pro 
690 695 700 

GCA GGG ATT TCA GCA ACT GAA ATA GCT AAA TTA GGT CCT GTT TGT GTG 2280 
Ala Gly lie Ser Ala Thr Glu He Ala Lys Leu Gly Pro Val Cys Val 
705 710 715 720 

TCT GCT CCT GCA TCA ATG TTG CCT GTG ACC TCA AGT AGG TTT AAG AGG 2328 
Ser Ala Pro Ala Ser Met Leu Pro Val Thr Ser Ser Arg Phe Lys Arg 
725 730 " 735 

CGG ACC AGC TCT CCT CCC AGT TCT CCA CAG CAC AGT CCT GCC CTT CGA 2 376 

Arg Thr Ser Ser Pro Pro Ser Ser Pro Gin His Ser Pro Ala Leu Arg 
740 745 750 

GAC TTT GGA AAG CCA AGT GAT GGG AAA GCA GCA TGG ACC GAT GCC GGG 2424 
Asp Phe Gly Lys Pro Ser Asp Gly Lys Ala Ala Trp Thr Asp Ala Gly 
755 760 765 

CTG ACT TCC AAA AAA TCC AAA TTA GAA AGT CAC AGC GAC TCA CCA GCA 2472 
Leu Thr Ser Lys Lys Ser Lys Leu Glu Ser His Ser Asp Ser Pro Ala 
770 775 780 

TGG AGT TTG TCT GGG AGA GAT GAG AGA GAA ACT GTG AGC CCT CCA TGC 2520 
Trp Ser Leu Ser Gly Arg Asp Glu Arg Glu Thr Val Ser Pro Pro Cys 
785 790 79S 800 

TTT GAT GAA TAT AAA ATG TCT AAA GAG TGG ACA GCT AGT TCT GCT TTT 2 568 

Phe Asp Glu Tyr Lys Met Ser Lys Glu Trp Thr Ala Ser Ser Ala Phe 
805 810 815 

AGC AGT GTG TGC AAC CAG CAG CCA CTG GAT TTA TCC AGC GGT GTC AAA 2616 
Ser Ser Val Cys Asn Gin Gin Pro Leu Asp Leu Ser Ser Gly Val Lys 
820 825 830 

CAG AAG GCT GAG GGT ACA GGC AAG ACT CCA GTC CAG TGG GAA TCT GTC 2664 
Gin Lys Ala Glu Gly Thr Gly Lys Thr Pro Val Gin Trp Glu Ser Val 
835 840 845 

TTA GAT CTC AGT GTG CAT AAA AAG CAT TGT AGT GAC TCT GAA GGC AAG 2712 
Leu Asp Leu Ser Val His Lys Lys His Cys Ser Asp Ser Glu Gly Lys 
850 855 860 

GAA TTC AAA GAA AGT CAT TCA GTG CAG CCT ACG TGT AGT GCT GTA AAG 2760 
Glu Phe Lys Glu Ser His Ser Val Gin Pro Thr Cys Ser Ala Val Lys 
865 870 875 880 

AAA AGG AAA CCA ACC ACC TGC ATG CTG CAG AAG GTT CTT CTC AAT GAA 28 08 

Lys Arg Lys Pro Thr Thr Cys Met Leu Gin Lys Val Leu Leu Asn Glu 
885 890 895 

TAT AAT GGC ATC GAT TTA CCT GTA GAA AAC CCT GCA GAT GGG ACC AGG 2856 
Tyr Asn Gly He Asp Leu Pro Val Glu Asn Pro Ala Asp Gly Thr Arg 
900 905 910 

AGC CCA AGT CCT TGT AAA TCC CTA GAA GCT CAG CCA GAT CCT GAC CTC 2904 
Ser Pro Ser Pro Cys Lys Ser Leu Glu Ala Gin Pro Asp Pro Asp Leu 
915 920 925 

GGT CCG GGC TCT GGT TTC CCT GCC CCT ACT GTT GAG TCC ACA CCT GAT 2952 
Gly Pro Gly Ser Gly Phe Pro Ala Pro Thr Val Glu Ser Thr Pro Asp 
930 935 940 

GTT TGT CCT TCA TCA CCT GCC CTG CAG ACA CCC TCC CTT TCA TCC GGT 3000 
Val Cys Pro Ser Ser Pro Ala Leu Gin Thr Pro Ser Leu Ser Ser Gly 
945 950 955 960 



3x 



CAG CTG CCT CCT CTC TTG ATC CCC ACA GAT CCC TCT TCC CCT CCA CCC 
Gin Leu Pro Pro Leu Leu lie Pro Thr Asp Pro Ser Ser Pro Pro Pro 
965 970 975 

TGT CCC CCG GTA TTA ACT GTT GCC ACT CCG CCC CCT CCC CTC CTT CCT 
Cys Pro Pro Val Leu Thr Val Ala Thr Pro Pro Pro Pro Leu Leu Pro 
980 985 990 

ACC GTA CCT CTT CCA GCC CCC TCT TCC AGT GCA TCT CCA CAC CCA TGC 
Thr Val Pro Leu Pro Ala Pro Ser Ser Ser Ala Ser Pro His Pro Cys 
995 1000 1005 

CCC TCT CCA CTC TCA AAT GCC ACC GCA CAG TCC CCA CTT CCA ATT CTG 
Pro Ser Pro Leu Ser Asn Ala Thr Ala Gin Ser Pro Leu Pro He Leu 
1010 1015 1020 

TCC CCA ACA GTG TCC CCC TCT CCC TCT CCC ATT CCT CCC GTG GAG CCC 
Ser Pro Thr Val Ser Pro Ser Pro Ser Pro He Pro Pro Val Glu Pre 
1025 1030 1035 1040 

CTG ATG TCT GCC GCC TCA CCC GGG CCT CCA ACA CTT TCT TCT TCC TCC 
Leu Met Ser Ala Ala Ser Pro Gly Pro Pro Thr Leu Ser Ser Ser Ser 
1045 1050 1055 

TCT TCA TCT TCC TCC TCC TCT TCG TTT TCT TCT TCA TCT TCC TCC TCT 
Ser Ser Ser Ser Ser Ser Ser Ser Phe Ser Ser Ser Ser Ser Ser Ser 
1060 1065 1070 

TCT CCT TCT CCA CCT CCT CTC TCC GCA ATA TCA TCT GTT GTT TCC TCT 
Ser Pro Ser Pro Pro Pro Leu Ser Ala lie Ser Ser Val Val Ser Ser 
1075 1080 1085 

GGT GAT AAT CTG GAG GCT TCT CTC CCC ATG ATA TCT TTC AAA CAG GAG 
Gly Asp Asn Leu Glu Ala Ser Leu Pro Met lie Ser Phe Lys Gin Glu 
1090 1095 HOO 

GAA TTA GAG AAT GAA GGT CTG AAA CCC AGG GAA GAG CCC CAG TCT GCT 
Glu Leu Glu Asn Glu Gly Leu Lys Pro Arg Glu Glu Pro Gin Ser Ala 
1105 1110 HIS H20 

GCT GAA CAG GAT GTT GTT GTT CAG GAA ACA TTC AAC AAA AAC TTT GTT 
Ala Glu Gin Asp Val Val Val Gin Glu Thr Phe Asn Lys Asn Phe Val 
1125 1130 H35 

TGC AAC GTC TGT GAA TCA CCT TTT CTT TCC ATT AAA GAT CTA ACC AAA 
Cys Asn Val tys Glu Ser Pro Phe Leu Ser He Lys Asp Leu Thr Lys 
1140 1145 H50 

CAT TTA TCT ATT CAT GCT GAA GAA TGG CCC TTC AAA TGT GAA TTT TGT 
His Leu Ser He His Ala Glu Glu Trp Pro Phe Lys Cys Glu Phe Cys 
1155 1160 - H65 

GTG CAG CTT TTT AAG GAT AAA ACG GAC TTG TCA GAA CAT CGC TTT TTG 
Val Gin Leu Phe Lys Asp Lys Thr Asp Leu Ser Glu His Arg Phe Leu 
1170 H75 H60 



3048 



3096 



3144 



3192 



3240 



3288 



3336 



3384 



3432 



3480 



3528 



3576 



3624 



3672 



CTT CAT GGA GTT GGG AAT ATC TTT GTG TGT TCT GTT TGT AAA AAA GAA 
Leu His Gly Val Gly Asn He Phe val Cys Ser Val Cys Lys Lys Glu 
1185 ' H90 H95 1200 



3720 



TTT GCT TTT TTG TGC AAT TTG CAG CAG CAC CAG CGA GAT CTC CAC CCA 

Phe Ala Phe Leu Cys Asn Leu Gin Gin His Gin Arg Asp Leu His Pro 
1205 1210 121S 

GAT AAG GTG TGC ACA CAT CAC GAG TTT GAA AGC GGG ACT CTG AGG CCC 

Asp Lys Val Cys Thr His His Glu Phe Glu Ser Gly Thr Leu Arg Pro 

1220 1225 1230 



3768 



3816 



CAG AAC TTT ACA GAT CCC AGC AAG GCC CAT GTA GAG CAT ATG CAG AGC 3 864 

Gin Asn Phe Thr Asp Pro Ser Lys Ala His Val Glu His Met Gin Ser 
1235 1240 1245 

TTG CCA GAA GAT CCT TTA GAA ACT TCT AAA GAA GAA GAG GAG TTA AAT 3912 
Leu Pro Glu Asp Pro Leu Glu Thr Ser Lys Glu Glu Glu Glu Leu Asn 
1250 1255 1260 

GAT TCC TCT GAA GAG CTT TAC ACG ACT ATA AAA ATA ATG GCT TCT GGA 3960 
Asp Ser Ser Glu Glu Leu Tyr Thr Thr lie Lys He Met Ala Ser Gly 
1265 1270 1275 1280 

ATA AAG ACA AAA GAT CCA GAT GTT CGA TTG GGC CTC AAT CAG CAT TAC 4 008 

He Lys Thr Lys Asp Pro Asp Val Arg Leu Gly Leu Asn Gin His Tyr 
1285 1290 1295 

CCA AGC TTT AAA CCA CCT CCA TTT CAG TAC CAT CAC CGT AAC CCC ATG 4056 
Pro Ser Phe Lys Pro Pro Pro Phe Gin Tyr His His Arg Asn Pro Met 
1300 1305 1310 

GGG ATT GGT GTG ACA GCC ACA AAT TTC ACT ACA CAC AAT ATT CCA CAG 4104 
Gly He Gly Val Thr Ala Thr Asn Phe Thr Thr His Asn He Pro Gin 
1315 1320 1325 

ACT TTC ACT ACC GCC ATT CGC TGC ACA AAG TGT GGA AAA GGT GTC GAC 4152 
Thr Phe Thr Thr Ala He Arg Cys Thr Lys Cys Gly Lys Gly Val Asp 
1330 1335 1340 

AAT ATG CCG GAG TTG CAC AAA CAT ATC CTG GCT TGT GCT TCT GCA AGT 4 200 

Asn Met Pro Glu Leu His Lys His He Leu Ala Cys Ala Ser Ala Ser 
1345 1350 1355 1360 

GAC AAG AAG AGG TAC ACG CCT AAG AAA AAC CCA GTA CCA TTA AAA CAA 4248 
Asp Lys Lys Arg Tyr Thr Pro Lys Lys Asn Pro Val Pro Leu Lys Gin 
1365 1370 1375 

ACT GTG CAA CCC AAA AAT GGC GTG GTG GTT TTA GAT AAC TCT GGG AAA 4296 
Thr val Gin Pro Lys Asn Gly Val Val Val Leu Asp Asn Ser Gly Lys 
1380 1385 1390 

AAT GCC TTC CGA CGA ATG GGA CAG CCC AAA AGG CTT AAC TTT AGT GTT 4344 
Asn Ala Phe Arg Arg Met Gly Gin Pro Lys Arg Leu Asn Phe Ser Val 
1395 1400 1405 

GAG CTC AGC AAA ATG TCG TCG AAT AAG CTC AAA TTA AAT GCA TTG AAG 4392 
Glu Leu Ser Lys Met Ser Ser Asn Lys Leu Lys Leu Asn Ala Leu Lys 
1410 1415 1420 

AAA AAA AAT CAG CTA GTA CAG AAA GCA ATT CTT CAG AAA AAC AAA TCT 444 0 

Lys Lys Asn Gin Leu Val Gin Lys Ala He Leu Gin Lys Asn Lys Ser 
1425 1430 1435 1440 

GCA AAG CAG AAG GCC GAC TTG AAA AAT GCT TGT GAG TCA TCC TCT CAC 4488 
Ala Lys Gin Lys Ala Asp Leu Lys Asn Ala Cys Glu Ser Ser Ser His 
1445 1450 ^SS 

ATC TGC CCT TAC TGT AAT CGA GAG TTC ACT TAC ATT GGA AGC CTG AAT 4 536 

He Cys Pro Tyr Cys Asn Arg Glu Phe Thr Tyr He Gly Ser Leu Asn 
1460 * 1465 14^0 

AAA CAC GCC GCC TTC AGC TGT CCC AAA AAA CCC CTT TCT CCT CCC AAA 4584 
Lys His Ala Ala Phe Ser Cys Pro Lys Lys Pro Leu Ser Pro Pro Lys 
1475 1480 1485 

AAA AAA GTT TCT CAT TCA TCT AAG AAA GGT GGA CAC TCA TCA CCT GCA 4632 
Lys Lys Val Ser His Ser Ser Lys Lys Gly Gly His Ser Ser Pro Ala 
1490 1495 1500 



AGT AGT GAC AAA AAC AGT AAC AGC AAC CAC CGC AGA CGG ACA GCG GAT 
Ser Ser Asp Lys Asn Ser Asn Ser Asn His Arg Arg Arg Thr Ala Asp 
1505 1510 151S 1520 

GCG GAG ATT AAA ATG CAA AGC ATG CAG ACT CCG TTG GGC AAG ACC AGA 
Ala Glu He Lys Met Gin Ser Met Gin Thr Pro Leu Gly Lys Thr Arg 
1525 1530 1535 

GCC CGC AGC TCA GGC CCC ACC CAA GTC CCA CTT CCC TCC TCA TCC TTC 
Ala Arg Ser Ser Gly Pro Thr Gin Val Pro Leu Pro Ser Ser Ser Phe 
1540 1545 1550 

AGG TCC AAG CAG AAC GTC AAG TTT GCA GCT TCG GTG AAA TCC AAA AAA 
Arg Ser Lys Gin Asn Val Lys Phe Ala Ala Ser Val Lys Ser Lys Lys 
1555 1560 1565 

CCA AGC TCC TCC TCT TTA AGG AAC TCC AGC CCG ATA AGA ATG GCC AAA 
Pro Ser Ser Ser Ser Leu Arg Asn Ser Ser Pro He Arg Met Ala Lys 
1570 1575 1580 

ATA ACT CAT GTT GAG GGG AAA AAA CCT AAA GCT GTG GCC AAG AAT CAT 
He Thr His Val Glu Gly Lys Lys Pro Lys Ala Val Ala Lys Asn His 
1585 1590 1595 1600 

TCT GCT CAG CTT TCC AGC AAA ACA TCG CGG AGC CTG CAC GTG AGG GTA 
Ser Ala Gin Leu Ser Ser Lys Thr Ser Arg Ser Leu His Val Arg Val 
1605 1610 1615 

CAG AAA AGC AAA GCT GTT TTA CAA AGC AAA TCC ACC TTG GCG AGT AAG 
Gin Lys Ser Lys Ala Val Leu Gin Ser Lys Ser Thr Leu Ala Ser Lys 
1620 162S 1630 

AAA AGA ACA GAC CGG TTC AAT ATA AAA TCT AGA GAG CGG AGT GGG GGG 
Lys Arg Thr Asp Arg Phe Asn He Lys Ser Arg Glu Arg Ser Gly Gly 
1635 1640 1645 

CCA GTC ACC CGG AGC CTT CAG CTG GCA GCT GCT GCT GAC TTG AGT GAG 
Pro Val Thr Arg Ser Leu Gin Leu Ala Ala Ala Ala Asp Leu Ser Glu 
1650 1655 1660 

AAC AAG AGA GAG GAC GGC AGC GCC AAG CAG GAG CTG AAG GAC TTC AGC 
Asn Lys Arg Glu Asp Gly Ser Ala Lys Gin Glu Leu Lys Asp Phe Ser 
1665 1670 1675 1680 

TAC AGC CTC CGC TTG GCG TCC CGA TGC TCT CCA CCA GCC GCC TCT TAC 
Tyr Ser Leu Arg Leu Ala Ser Arg Cys Ser Pro Pro Ala Ala Ser Tyr 
1685 1690 1695 

ATC ACC AGG CAG TAT AGG AAG GTC AAA GCT CCG GCT GCA GCC CAG TTC 
He Thr Arg Gin Tyr Arg Lys Val Lys Ala Pro Ala Ala Ala Gin Phe 
1700 " 1705 1710 

CAG GGA CCA TTC TTC AAA GAG T AGACACTCTG GCTGCTCCCT GACAGCACCT 
Gin Gly Pro Phe Phe Lys Glu 
1715 

GAAGTGACCT GGAATCAGTG AAGCCAAAGG GACTGGCAGT CTGCCTGCAG GGAGTACCGA 
CCTATCCCAG TTGTGTGAGG CTGCGAGAGA AAGGGAGTGC ATGTGCGCGC GTGCATGTGT 
GCGTGCGTGT GTGTTCACGT GTTCTCGTGC GGGCCGTGAG TGGTCTTCAA ACGAGGGTCC 
CGAACCCCGG GGCGGCAGGA AGGGGGCCGA CTCCACGCTG TCCTTTGGGA TGATACTTGG 
ATCTCAGCTC TTGGGACCGT GTCTGCAGCC CAGCCTTCCT GTTGGGGTGG GGCCTCTCCT 
ACTATGCAAT TTTTCAAGAG CTCCTTGACC CTGCTTTTTG CTTCTTGAGT TGTCTTTTGC 



4680 

4728 

4776 

4824 

4872 

4920 

4968 

5016 

5064 

5112 

5160 

5208 

5256 

5308 

5368 
5428 
5488 
5548 
5608 
5668 



3Jr 



5868 



CATTATGGGG ACTTTCGTTT GACCCAGGGG TCAGCCCTTT AGGAAGGCCT TCAGGAGGAG 5728 

GCCGAGTTCC CCTTCAGTAC CACCCCTCTC TCCCCACCTG CCCGCTCCCG GCAACATCTC S788 

TGGGAATCAA CAGCATATTG ACACGTTGGA GCCGAGCCTG AACATGCCCT GACCCCAGCA S84 8 
CATGGAAAAC CCCCTTCCTT 

SEQ ID NO. 4: 
. (i) ffMftffi: 

(a) Kfts ms&mm (b) mm-, urn® 

(D) ttft 8 
. (ii) ^HSffli gfijg 

Met Aan Gin Asn Thr Thr Glu Pro Val Ala Ala Thr Glu Thr Leu Ala 
1 5 io is 

Glu Val' Pro Glu His Val Leu Arg Gly Leu Pro Glu Glu Val Arg Leu 
20 25 30 : 

Phe Pro Ser Ala Val Asp Lys Thr Arg He Gly Val Trp Ala Thr Lys 
35 40 4S 

Pro lie Leu Lys Gly Lys Lys Phe Gly Pro Phe Val Gly Asp Lys Lys 
50 55 60 

Lys Arg Ser Gin Val Lys Asn Asn Val Tyr Met Trp Glu Val Tyr Tyr 
65 70 75 80 

Pro Asn Leu Gly Trp Met Cys He Asp Ala Thr Asp Pro Glu Lys Gly 
85 90 9s 

Asn Trp Leu Arg Tyr Val Asn Trp Ala Cys Ser Gly Glu Glu Gin Asn 
100 105 no 

Leu Phe Pro Leu Glu He Asn Arg Ala He Tyr Tyr Lys Thr Leu Lvs 
115 120 12 5 

Pro He Ala Pro Gly Glu Glu Leu Leu Val Trp Tyr Asn Gly Glu Asp 
130 135 140 

Asn Pro Glu He Ala Ala Ala He Glu Glu Glu Arg Ala Ser Ala Arq 
14S 150 155 160 

Ser Lye Arg Ser Ser Pro Lys Ser Arg Lys Gly Lys Lys Lys Ser Gin 
165 170 175 

Glu Asn Lys Asn Lys Gly Asn Lys He Gin Asp He Gin Leu. Lys Thr 
180 185 190 

Ser Glu Pro Asp Phe Thr Ser Ala Asn Met Arg Asp Ser Ala Glu Gly 
195 200 205 

Pro Lys Glu Asp Glu Glu Lys Pro Ser Ala Ser Ala Leu Glu Gin Pro 
210 215 220 

Ala Thr Leu Gin Glu Val Ala Ser Gin Glu Val Pro Pro Glu Leu Ala 
225 230 235 240 

Thr Pro Ala Pro Ala Trp Glu Pro Gin Pro Glu Pro Asp Glu Arg Leu 
245 250 ■ 255 



Glu Ala Ala Ala Cys Glu Val Asn Asp Leu Gly Glu Glu Glu Glu Glu 
260 265 270 

Glu Glu Glu Glu Asp Glu Glu Glu Glu Glu Asp Asp Asp Asp Asp Glu 
275 280 * 28 5 

Leu Glu Asp Glu Gly Glu Glu Glu Ala Ser Met Pro Asn Glu Asn Ser 
290 295 300 

Val Lys Glu Pro Glu lie Arg Cys Asp Glu Lys Pro Glu Asp Leu Leu 
305 310 315 320 

Glu Glu Pro Lys Thr Thr Ser Glu Glu Thr Leu Glu Asp Cys Ser Glu 
325 330 335 

Val Thr Pro Ala Met Gin lie Pro Arg Thr Lys Glu Glu Ala Asn Gly 
340 345 350 

Asp Val Phe Glu Thr Phe Met Phe Pro Cys Gin His Cys Glu Arg Lys 
355 360 365 

Phe Thr Thr Lys Gin Gly Leu Glu Arg His Met His He His He Ser 
370 375 380 

Thr Val Asn His Ala Phe Lys Cys Lys Tyr Cys Gly Lys Ala Phe Gly 
385 3 90 395 4 00 

Thr Gin He Asn Arg Arg Arg His Glu Arg Arg His Glu Ala Gly Leu 
405 410 415 

Lys Arg Lys Pro Ser Gin Thr Leu Gin Pro Ser Glu Asp Leu Ala Asp 
42 0 425 430 

Gly Lys Ala Ser Gly Glu Asn Val Ala Ser Lys Asp Asp Ser Ser Pro 
435 440 445 

Pro Ser Leu Gly Pro Asp Cys Leu He Met Asn Ser Glu Lys Ala Ser 
4S <> 455 460 

Gin Asp Thr He Asn Ser Ser Val Val Glu Glu Asn Gly Glu Val Lys 
465 470 475 480 

Glu Leu His Pro Cys Lys Tyr Cys Lys Lys Val Phe Gly Thr His Thr 
485 490 495 

Asn Met Arg Arg His Gin Arg Arg Val His Glu Arg His Leu He Pro 
500 505 " 510 

Lys Gly Val Arg Arg Lys Gly Gly Leu Glu Glu Pro Gin Pro Pro Ala 
515 520 525 

Glu Gin Ala Gin Ala Thr Gin Asn Val Tyr Val Pro Ser Thr Glu Pro 
5 3 0 535 5 4 0 

Glu Glu Glu Gly Glu Ala Asp Asp Val Tyr lie Met Asp He Ser Ser 
545 S50 S55 * 560 

Asn He Ser Glu Asn Leu Asn Tyr Tyr He Asp Gly Lys He Gin Thr 
565 570 575 

Asn Asn Asn Thr Ser Asn Cys Asp Val He Glu Met Glu Ser Ala Ser 
580 S85 590 

Ala Asp Leu Tyr Gly il e A sn Cys Leu Leu Thr Pro Val Thr Val Glu 
595 600 60S 



lie Thr Gin Asn lie Lys Thr Thr Gin Val Pro Val Thr Glu Asp Leu 
610 61S 620 

Pro Lys Glu Pro Leu Gly Ser Thr Asn Ser Glu Ala Lys Lys Arg Arg 
625 630 635 640 

Thr Ala Ser Pro Pro Ala Leu Pro Lys lie Lys Ala Glu Thr Asp Ser 
645 650 655 

Asp Pro Met Val Pro Ser Cys Ser Leu Ser Leu Pro Leu Ser lie Ser 
660 665 670 

Thr Thr Glu Ala val Ser Phe His Lys Glu Lys Ser Val Tyr Leu Ser 
675 660 685 

Ser Lys Leu Lys. Gin Leu Leu Gin Thr Gin Asp Lys Leu Thr Pro Pro 
690 695 700 

Ala Gly lie Ser Ala Thr Glu He Ala Lys Leu Gly Pro Val Cys Val 
705 710 715 720 

Ser Ala Pro Ala Ser Met Leu Pro Val Thr Ser Ser Arg Phe Lys Arg 
725 730 735 

Arg Thr Ser Ser Pro Pro Ser Ser Pro Gin His Ser Pro Ala Leu Arg 
740 745 750 

Asp Phe Gly Lys Pro Ser Asp Gly Lys Ala Ala Trp Thr Asp Ala Gly 
755 760 765 

Leu Thr Ser Lys Lys Ser Lys Leu Glu Ser His Ser Asp Ser .Pro Ala 
770 775 780 

Trp Ser Leu Ser Gly Arg Asp Glu Arg Glu Thr Val Ser Pro Pro Cys 
785 790 795 800 

Phe Asp Glu Tyr Lys Met Ser Lys Glu Trp Thr Ala Ser Ser Ala Phe 
805 810 815 

Ser Ser Val Cys Asn Gin Gin Pro Leu Asp Leu Ser Ser Gly Val Lys 
820 825 830 

Gin Lys Ala Glu Gly Thr Gly Lys Thr Pro Val Gin Trp Glu Ser Val 
83S 840 845 

Leu Asp Leu Ser Val His Lys Lys His Cys Ser Asp Ser Glu Gly Lys 
850 855 860 

Glu Phe Lys Glu Ser His Ser Val Gin Pro Thr Cys Ser Ala Val Lys 
865 870 875 880 

Lys Arg Lys Pro Thr Thr Cys Met Leu Gin Lys Val Leu Leu Asn Glu 
885 890 895 

Tyr Asn Gly He Asp Leu Pro Val Glu Asn Pro Ala Asp Gly Thr Arg 
900 905 910 

Ser Pro Ser Pro Cys Lys Ser Leu Glu Ala Gin Pro Asp Pro Asp Leu 
915 920 925 

Gly Pro Gly Ser Gly Phe Pro Ala Pro Thr Val Glu Ser Thr Pro Asp 
930 935 940 

Val Cys Pro Ser Ser Pro Ala Leu Gin Thr Pro Ser Leu Ser Ser Gly 
945 950 955 960 




Gin Leu Pro Pro Leu Leu lie Pro Thr Asp Pro Ser Ser Pro Pro Pro 
965 970 975 

Cys Pro Pro Val Leu Thr Val Ala Thr Pro Pro Pro Pro Leu Leu Pro 
980 985 990 

Thr Val Pro Leu Pro Ala Pro Ser Ser Ser Ala Ser Pro His Pro Cys 
995 1000 1005 

Pro Ser Pro Leu Ser Asn Ala Thr Ala Gin Ser Pro Leu Pro lie Leu 

1010 1015 1020 

Ser Pro Thr Val Ser Pro Ser Pro Ser Pro lie Pro Pro Val Glu Pro 
1025 1030 1035 1040 

Leu Met Ser Ala Ala Ser Pro Gly Pro Pro Thr Leu Ser Ser Ser Ser 
1045 1050 1055 

Ser Ser Ser Ser Ser Ser Ser Ser Phe Ser Ser Ser Ser Ser Ser Ser 
1060 1065 1070 

Ser Pro Ser Pro Pro Pro Leu Ser Ala lie Ser Ser Val Val Ser Ser 
1075 1080 1085 

Gly Asp Asn Leu Glu Ala Ser Leu Pro Met lie Ser Phe Lys Gin Glu 

1090 1095 1100 

Glu Leu Glu Asn Glu Gly Leu Lys Pro Arg Glu Glu Pro Gin Ser Ala 
1105 1110 ill5 1120 

Ala Glu Gin Asp Val Val Val Gin Glu Thr Phe Asn Lys Asn Phe Val 
1125 1130 1135 

Cys Asn Val Cys Glu Ser Pro Phe Leu Ser lie Lys Asp Leu Thr Lys 
1140 1145 1150 

His Leu Ser He His Ala Glu Glu Trp Pro Phe Lys Cys Glu Phe Cys 
1155 1160 1165 

Val Gin Leu Phe Lys Asp Lya Thr Asp Leu Ser Glu His Arg Phe Leu 

1170 117S 1180 

Leu His Gly Val Gly Asn He Phe Val Cys Ser Val Cys Lys Lys Glu 
1185 1190 1195 1200 

Phe Ala Phe Leu Cys Asn Leu Gin Gin His Gin Arg Asp Leu His Pro 
1205 1210 1215 

Asp Lys Val Cys Thr His His Glu Phe Glu Ser Gly Thr Leu Arg Pro 
1220 1225 1230 

Gin Asn Phe Thr Asp Pro Ser Lys Ala His Val Glu His Met Gin Ser 
1235 1240 1245 

Leu Pro Glu Asp Pro Leu Glu Thr Ser Lys Glu Glu Glu Glu Leu Asn 

1250 1255 1260 

Asp Ser Ser Glu Glu Leu Tyr Thr Thr He Lys lie Met Ala Ser Gly 
1265 1270 1275 1280 

He Lys Thr Lys Asp Pro Asp Val Arg Leu Gly Leu Asn Gin His Tyr 
1285 1290 1295 

Pro Ser Phe Lys Pro Pro Pro Phe Gin Tyr His His Arg Asn Pro Met 
1300 1305 1310 



Gly He Gly Val Thr Ala Thr Asn Phe Thr Thr His Asn He Pro Gin 
1315 1320 1325 

Thr Phe Thr Thr Ala He Arg Cys Thr Lys Cys Gly Lys Gly Val A9p 
1330 1335 1340 

Asn Met Pro Glu Leu His Lys His He Leu Ala Cys Ala Ser Ala Ser 
1345 1350 1355 1360 

Asp Lys Lys Arg Tyr Thr Pro Lys Lys A9n Pro Val Pro Leu Lys Gin 
1365 1370 1375 

Thr Val Gin Pro Lys Asn Gly Val Val Val Leu Asp Asn Ser Gly Lys 
1380 1385 1390 

Asn Ala Phe Arg Arg Met Gly Gin Pro Lys Arg Leu Asn Phe Ser Val 
1395 1400 1405 

Glu Leu Ser Lys Met Ser Ser Asn Lys Leu Lys Leu Asn Ala Leu Lys 
1410 1415 1420 

Lys Lys Asn Gin Leu Val Gin Lys Ala He Leu Gin Lys Asn Lys Ser 
1425 1430 1435 * 1440 

Ala Lys Gin Lys Ala Asp Leu Lys Asn Ala Cys Glu Ser Ser Ser His 
1445 1450 1455 

lie Cys Pro Tyr Cys Asn Arg Glu Phe Thr Tyr He Gly Ser Leu Asn 
1460 1465 1470 

Lys His Ala Ala Phe Ser Cys Pro Lys Lys Pro Leu Ser Pro Pro Lys 
1475 1480 1485 

Lys Lys Val Ser His Ser Ser Lys Lys Gly Gly His Ser Ser Pro Ala 
1490 1495 1500 

Ser Ser Asp Lys Asn Ser Asn Ser Asn His Arg Arg Arg Thr Ala Asp 
1505 1510 1515 1520 

Ala Glu He Lys Met Gin Ser Met Gin Thr Pro Leu Gly Lys Thr Arg 
1525 1530 1535 

Ala Arg Ser Ser Gly Pro Thr Gin Val Pro Leu Pro Ser Ser Ser Phe 
1540 1S45 1S50 

Arg Ser Lys Gin Asn Val Lys Phe Ala Ala Ser Val Lys Ser Lys Lys 
1555 1560 1565 

Pro Ser Ser Ser Ser Leu Arg Asn Ser Ser Pro He Arg Met Ala Lys 
1570 1575 1S80 

He Thr His Val Glu Gly Lys Lys Pro Lys Ala Val Ala Lys Asn His 
1585 1590 1595 1600 

Ser Ala Gin Leu Ser Ser Lys Thr Ser Arg Ser Leu His Val Arg Val 
1605 1610 1615 

Gin Lys Ser Lys Ala Val Leu Gin Ser Lys Ser Thr Leu Ala Ser Lys 
1620 1625 1630 

Lys Arg Thr Asp Arg Phe Asn He Lys Ser Arg Glu Arg Ser Gly Gly 
1635 1640 1645 

Pro Val Thr Arg Ser Leu Gin Leu Ala Ala Ala Ala Asp Leu Ser Glu 
1650 1655 1660 



Asn Lys Arg Glu Asp Gly Ser Ala Lys Gin Glu Leu Lys Asp Phe Ser 
1665 1670 1675 1680 

Tyr Ser Leu Arg Leu Ala Ser Arg Cys Ser Pro Pro Ala Ala Ser Tyr 

1685 1690 1695 

Me Thr Arg Clu Tyr Arg Lys Val Lys Ala Pro Ala Ala Ala Gin Phe 

1700 1705 1710 

Gin Gly Pro Phe Phe Lys Glu 
1715 



SEQ ID NO. 100: 

(i) ffflJflpfiE: 

(A) -K«s 1 15 UK (B) %£!: MS! 

(D) «tt 

Met Pro Gly Ala Gly Leu Gly He Trp Thr Lys Arg Lys He Glu Val 
1 5 10 15 

Gly Glu Lya Phe Gly Pro Tyr Val Gly Glu Gin Arg Ser Asn Leu Lys 
20 25 30 

Asp Pro Ser Tyr Gly Trp Glu He Leu Asp Glu Phe Tyr Asn Val Lys 
35 40 45 

Phe Cys lie Asp Ala Ser Gin Pro Asp Val Gly Ser Trp Leu Lys Tyr 
50 55 60 

He Arg Phe Ala Gly Cys Tyr Asp Gin His Asn Leu Val AJa Cys Gin 
65 70 75 80 

lie Asn Asp Gin He Phe Tyr Arg Val Val Ala Asp He Ala Pro Gly 
85 90 9S 

Glu Glu Leu Leu Leu Phe Met Lys Ser Glu Asp Tyr Pro His Glu Thr 
100 105 HO 

Met Ala Pro 
115 

SEQ ID NO. 101: 
(i) JWfltffiE: 

(a) **: no mm (b) m: mm 

(D) tttt 

Asp Glu Leu His Gly Asn Val Leu He Ala Val Thr Gin He Ala Leu 
15 10 15 

Gly Arg Thr He Gly Val He Asp Lys Ala Thr Pro Asn Asp Ser Asn 
20 25 30 

Ala Leu Leu He Leu Asn Leu He Lys Glu Ala Asp Asp Gly Glu Asp 
35 40 45 

Ala Asn He Cys Met Arg Gin Glu Asp Arg Lys Thr Phe Leu Gin Thr 
50 55 60 

Ser Lys lie He Asn He Gly Glu Arg Leu Leu Leu Gin Arg Leu Ser 
65 70 75 80 

Glu Glu Glu Cys Asp Glu Glu Asp Gin Asp Asp Leu Glu Asn Leu He 
85 90 95 

Leu Leu Lys Asp Glu Asp Arg Pro Asp Ser Thr Gin Ser Cys 
100 ios no 



SEQ ID NO. 102: 

(i) mmi: 

(A) 21 (b) mm: mm 

(D) ttn, 



Glu Trp Gly Pro Val Thr Arg Ser Leu Gin Arg Ser Thr Lys Gin Glu 
1 S 10 15 

Leu Lys Asp Leu Gin 
20 

SEQ ID NO. 103: 

(i) rnrnw-. 

(A) 22 n.mm (b) mss 

(D) £fa: 

Gly Ala Glu Glu Thr Ala Ala Ala Ala Ala Ala Leu Gly Ala Leu Arg 
15 10 IS 

Leu Gly Arg Arg Ala Arg 
20 

SEQ ID NO. 104: 

(i) mm®.-. 

(A) K£: 391 M£tt (B) ££: M 

(C) $ < D) ^ 

(ii) W€ 

(A) CDS (B) 1-391 • 

GTG TAC TAC CCA AAT TTG GGG TGG ATG TGC ATT GAT GCC ACT GAT CCG 4 8 

Val Tyr Tyr Pro Asn Leu Gly Trp Met Cys He Asp Ala Thr Asp Pro 
1.5 10 15 

GAG AAG GGC AAC TGG CTC CGC TAT GTG AAC TGG GCT TGC TCA GGA GAA 96 
Glu Lys Gly Asn Trp Leu Arg Tyr Val Asn Trp Ala Cys Ser Gly Glu 
20 25 30 

GAA CAG AAT TTA TTT CCA CTG GAA ATC AAC AGA GCC ATT TAC TAT AAA 14 4 

Glu Gin Asn Leu Phe Pro Leu Glu He Asn Arg Ala He Tyr Tyr Lys 
35 40 45 

ACC 1TA AAG CCA ATC GCG CCT GGC GAG GAG CTC CTG GTC TGG TAC AAT 192 
Thr Leu Lys Pro He Ala Pro Gly Glu Glu Leu Leu Val Trp Tyr Asn 
50 55 60 



GGG GAA GAC AAC CCC GAG ATA GCA GCT GCG ATT GAG GAA GAG CGA GCC 
Gly Glu Asp Asn Pro Glu lie Ala Ala Ala He Glu Glu Glu Arg Ala 
65 70 75 80 



AGC GCC CGG AGC AAG CGG AGC TCC CCG AAG AGC CGG AGA GGG AAG AAG 
Ser Ala Arg Ser Lys Arg Ser Ser Pro Lys Ser Arg Arg Gly Lys Lys 
85 90 95 

AAA TCA GAG GAG AAT AAA AAC AAA GGC ATC AGA ACC CAG GCT GCA GCG 
Lys Ser Gin Glu Asn Lys Asn Lys Gly He Arg Thr Gin Ala Ala Ala 
100 105 HO 

CGG AAG GCG AGC GAG CTG GAC TCC ACC TCT GCA AAC ATG AGG GGC TCT 
Arg Lys Ala Ser Glu Leu Asp Ser Thr Ser Ala Asn Met Arg Gly Ser 
115 120 125 

GCA GAA G 
Ala Glu 
130 

SEQ ID NO. 105: 

(i) *M«pfE: 

(a) (b) «: umm 

(D) &tt 

(ii) #T«: 

Val Tyr Tyr Pro Asn Leu Gly Trp Met Cys He Asp Ala Thr Asp Pro 
15 10 15 

Glu Lys Gly Asn Trp Leu Arg Tyr Val Asn Trp Ala Cys Ser Gly Glu 
20 25 30 

Glu Gin Asn Leu Phe Pro Leu Glu He Asn Arg Ala He Tyr Tyr Lys 
35 40 45 

Thr Leu Lys Pro He Ala Pro Gly Glu Glu Leu Leu Val Trp Tyr Asn 
SO 55 60 

Gly Glu Asp Asn Pro Glu He Ala Ala Ala He Glu Glu Glu Arg Ala 
65 70 75 80 

Ser Ala Arg Ser Lys Arg Ser Ser Pro Lys Ser Arg Arg Gly Lys Lys 
85 90 95 

Lys Ser Gin Glu Asn Lys Asn Lys Gly lie Arg Thr Gin Ala Ala Ala 
100 105 HO 

Arg Lys Ala Ser Glu Leu Asp Ser Thr Ser Ala Asn Met Arg Gly Ser 
115 120 125 

Ala Glu 
130 

SEQ ID NO. 106: 

(i) pnmw: 

(A) KJf: 20lK$*f (B) tm 

(0 * (D) m& 

(xi) FF?iJJ£i$: SEQ ID NO. 106: 
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